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(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4188 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 


(ii) MOLECULE TYPE: cDNA 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 




ATGCATCCCA 

TGCATCCCGA 

AAACCACGCC 

ATCGCCCGGA 

GCACGAGCAC 

CACTAATAAC 

60 

CCATCTCGCA 

GTCGGAGCAG 

CAGGATGTGG 

CTCCTGCCCG 

CCTGGCTGCT 

CCTCGTCCTG 

120 

GTGGCCAGCA 

ATGGCCTGCC 

AGCAGTCAGA 

GGCCAGTACC 

AATCGCCACG 

TATCATCGAG 

180 

CATCCCACGG 

ATCTGGTCGT 

TAAGAAGAAT 

GAACCCGCCA 

CGCTCAACTG 

CAAAGTGGAG 

240 

GGCAAGCCGG 

AACCCACCAT 

TGAGTGGTTT 

AAGGATGGCG 

AACCCGTCAG 

CACCAACGAA 

3 00. 

AAGAAATCGC 

ACCGCGTCCA 

GTTCAAGGAC 

GGCGCCCTCT 

TCTTTTACAG 

GACAATGCAA 

360 

GGCAAGAAGG 

AGCAGGACGG 

CGGAGAGTAC 

TGGTGCGTGG 

CCAAGAACCG 

AGTGGGCCAG 

420 

GCCGTTAGTC 

GCCATGCCTC 

CCTCCAGATA 

GCTGTTTTGC 

GCGACGATTT 

TCGCGTGGAG 

480 

CCCAAAGACA 

CGCGAGTGGC 

CAAAGGCGAG 

ACGGCTCTGC 

TGGAGTGTGG 

GCCGCCCAAA 

540 

GGCATTCCAG 

AGCCAACGCT 

GATTTGGATA 

AAGGACGGCG 

TTCCCTTGGA 

CGACCTGAAA 

600 

GCCATGTCGT 

TTGGQGCCAG 

CTCCCGCGTT 

CGAATTGTGG 

ACGGTGGCAA 

CCTGCTGATC 

660 

AGCAATGTGG 

AGCCCATTGA 

TGAGGGCAAC 

TACAAGTGCA 

TTGCCCAGAA 

TCTGGTAGGC 

720 

ACCCGCGAGA 

GCAGCTATGC 

CAAGCTGATT 

GTCCAGGTCA 

AACCATACTT 

TATGAAGGAG 

780 



CCCAAGGATC AGGTGATGCT CTACGGCCAG ACAGCCACTT TCCACTGCTC AGTGGGCGGT 840 

GATCCGCCGC CGAAAGTGTT GTGGAAAAAG GAGGAGGGCA ATATTCCGGT GTCCAGAGCG 900 

CGAATCCTTC ACGACGAGAA AAGTTTAGAG ATATCCAACA TAACGCCCAC CGATGAGGGC 960 

ACCTATGTCT GCGAGGCACA CAACAATGTC GGTCAGATCA GCGCTAGGGC TTCTCTTATA 102 0 

GTCCACGCTC CGCCGAACTT TACGAAAAGA CCCAGTAACA AGAAAGTGGG ACTAAATGGG 1080 

GTTGTCCAAC TACCTTGCAT GGCCTCCGGA AACCCTCCGC CGTCTGTATT CTGGACCAAG 114 0 

GAAGGAGTAT CCACTCTTAT GTTCCCAAAT AGTTCGCACG GAAGGCAGTA TGTGGCTGCC 12 00 

GATGGAACTC TGCAGATTAC GGATGTGCGG CAGGAAGACG AAGGCTACTA TGTGTGTTCC 12 60 

GCTTTCAGTG TAGTCGATTC CTCTACAGTA CGGGTTTTCC TGCAAGTCAG CTCGGTAGAC 132 0 

GAGCGTCCAC CTCCGATTAT TCAAATCGGA CCTGCCAATC AAACACTGCC CAAGGGATCA 13 8 0 

GTTGCTACTT TACCCTGTCG GGCCACTGGA AATCCCAGTC CCCGTATCAA GTGGTTCCAC 144 0 

GATGGACATG CCGTACAAGC GGGCAATCGA TACAGCATCA TCCAAGGAAG CTCACTGAGA 1500 

GTCGATGACC TTCAACTAAG TGACTCTGGT ACCTACACCT GCACTGCATC TGGCGAACGA 1560 

GGAGAAACTT CCTGGGCTGC CACACTAACG GTGGAAAAAC CCGGTTCTAC ATCTCTTCAC 162 0 

CGGGCAGCTG ATCCTAGCAC TTATCCTGCT CCTCCAGGAA CACCTAAAGT CCTGAATGTC 1680 

AGTCGCACCA GCATTAGTCT TCGTTGGGCT AAAAGCCAAG AGAAACCCGG AGCTGTGGGC 174 0 

CCAATCATTG GATACACTGT AGAGTACTTC AGTCCGGATC TGCAAACTGG TTGGATTGTG 1800 

GCTGCCCATC GAGTCGGCGA CACTCAAGTC ACTATCTCGG GTCTCACTCC TGGCACTTCG 1860 

TATGTGTTCC TAGTTAGAGC TGAGAATACT CAGGGTATTT CTGTGCCTTC CGGCTTATCA 192 0 

AATGTTATTA AAACCATTGA GGCAGATTTC GATGCAGCTT CTGCCAATGA TTTGTCAGCA 1980 

GCTCGAACTT TGCTGACAGG AAAGTCGGTG GAGCTAATAG ATGCCTCGGC TATCAATGCT 2 040 

AGTGCCGTTA GACTTGAGTG GATGCTCCAC GTGAGCGCTG ATGAGAAATA CGTAGAGGGC 2100 

CTGCGCATAC ACTATAAGGA TGCCAGTGTA CCATCCGCAC AGTATCACTC GATCACTGTT 2160 

ATGGATGCCT CTGCAGAATC GTTTGTGGTG GGAAACCTTA AGAAGTACAC CAAGTATGAG 222 0 

TTCTTCCTAA CACCCTTTTT TGAGACAATT GAAGGACAGC CCAGTAACTC CAAGACAGCC 228 0 

CTCACCTATG AAGATGTTCC CTCCGCACCA CCGGATAACA TTCAGATTGG CATGTACAAC 2 340 

CAAACAGCCG GTTGGGTGCG TTGGACTCCG CCACCCTCCC AGCACCACAA TGGCAATTTG 24 00 

TATGGCTACA AGATTGAGGT CAGCGCCGGT AACACCATGA AGGTGCTGGC CAATATGACT 2460 

CTTAATGCTA CCACCACATC TGTGCTCCTA AATAACCTAA CCACCGGAGC TGTGTACAGC 2520 

GTGAGGTTGA ACTCCTTTAC CAAGGCAGGA GATGGACCTT ACTCCAAACC GATATCACTA 258 0 

TTCATGGACC CCACCCATCA TGTGCATCCG CCACGGGCAC ATCCAAGCGG CACCCATGAT 2 64 0 

GGGCGACATG AGGGACAGGA TCTCACGTAT CATAACAATG GCAACATACC ACCTGGCGAC 2700 

ATTAATCCCA CCACTCATAA AAAGACCACT GACTACCTAT CTGGACCGTG GCTAATGGTG 2 760 

CTGGTCTGCA TCGTTCTTCT AGTCCTGGTT ATTTCGGCGG CTATTTCGAT GGTCTACTTC 2820 

AAGCGCAAGC ATCAAATGAC CAAGGAATTG GGTCACTTAA GTGTGGTCAG TGACAACGAA 2880 

ATAACCGCAT TAAATATCAA TAGCAAAGAG AGCCTTTGGA TAGACCATCA TCGTGGATGG 2 94 0 

CGAACTGCCG ATACTGACAA AGACTCAGGA TTAAGCGAAT CGAAGCTACT ATCCCACGTT 3 000 

AACAGCAGTC AATCCAACTA CAATAACTCC GATGGAGGAA CCGATTATGC AGAAGTTGAC 3 060 

ACCCGTAACC TTACCACCTT CTACAATTGT CGCAAGAGCC CCGATAATCC CACGCCGTAC 312 0 

GCCACCACTA TGATCATTGG TACCTCTTCC AGTGAGACCT GCACCAAGAC AACATCTATA 318 0 

AGTGCCGATA AGGACTCGGG AACTCATTCG CCCTATTCTG ACGCATTTGC CGGTCAGGTG 324 0 

CCAGCGGTTC CTGTTGTCAA ATCCAACTAT CTTCAGTATC CGGTTGAACC GATCAACTGG 33 00 

TCAGAGTTTC TACCCCCGCC GCCAGAACAC CCACCTCCGT CTTCTACCTA TGGATACGCA 3360 

CAAGGATCTC CTGAATCTTC GCGGAAGAGC TCCAAAAGCG CAGGTTCCGG CATTTCTACA 342 0 

AATCAAAGCA TTCTGAACGC ATCCATACAC AGCAGCTCCT CGGGCGGCTT TTCAGCTTGG 348 0 

GGAGTATCGC CCCAATATGC TGTCGCCTGT CCACCGGAAA ACGTTTATAG CAATCCGCTG 354 0 

TCGGCAGTGG CTGGCGGCAC CCAGAACCGC TATCAGATAA CGCCCACAAA CCAACATCCG 3600 

CCACAGTTAC CGGCCTACTT TGCCACCACG GGTCCAGGAG GAGCTGTACC ACCCAACCAC 3660 

CTGCCATTTG CCACACAGCG TCATGCAGCC AGCGAGTACC AGGCTGGACT GAATGCAGCG 372 0 

CGATGTGCCC AAAGCCGCGC CTGCAACAGC TGCGATGCCT TGGCCACACC CTCGCCCATG 3780 

CAACCCCCAC CGCCAGTTCC CGTACCCGAG GGCTGGTACC AACCGGTGCA TCCCAATAGC 3840 

CACCCGATGC ACCCGACCTC CTCCAACCAC CAGATCTACC AGTGCTCCTC CGAGTGCTCG 3900 

GATCACTCGA GGAGCTCGCA GAGTCACAAG CGGCAGCTGC AGCTCGAGGA GCACGGCAGC 3 960 

AGTGCCAAAC AACGCGGAGG ACACCACCGT CGACGAGCCC CGGTGGTGCA GCCGTGCATG 402 0 


GAGAGCGAGA ACGAGAACAT GCTGGCGGAG TACGAGCAGC GCCAGTACAC CAGCGATTGC 4 080 
TGCAATAGCT CCCGCGAGGG CGACACCTGC TCCTGCAGCG AGGGATCCTG TCTTTACGCC 4140 
GAGGCGGGCG AGCCGGCGCC TCGTCAAATG ACTGCTAAGA ACACCTAA 4188 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1395 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE : peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met His Pro Met His Pro Glu Asn His Ala lie Ala Arg Ser Thr Ser 

15 10 15 

Thr Thr Asn Asn Pro Ser Arg Ser Arg Ser Ser Arg Met Trp Leu Leu 

20 25 30 

Pro Ala Trp Leu Leu Leu Val Leu Val Ala Ser Asn Gly Leu Pro Ala 

35 40 45 

Val Arg Gly Gin Tyr Gin Ser Pro Arg He He Glu His Pro Thr Asp 

50 55 60 

Leu Val Val Lys Lys Asn Glu Pro Ala Thr Leu Asn Cys Lys Val Glu 
65 70 75 80 

Gly Lys Pro Glu Pro Thr He Glu Trp Phe Lys Asp Gly Glu Pro Val 

85 90 95 

Ser Thr Asn Glu Lys Lys Ser His Arg Val Gin Phe Lys Asp Gly Ala 

100 105 110 

Leu Phe Phe Tyr Arg Thr Met Gin Gly Lys Lys Glu Gin Asp Gly Gly 

115 120 125 

Glu Tyr Trp Cys Val Ala Lys Asn Arg Val Gly Gin Ala Val Ser Arg 

130 135 140 

His Ala Ser Leu Gin He Ala Val Leu Arg Asp Asp Phe Arg Val Glu 
145 150 155 160 

Pro Lys Asp Thr Arg Val Ala Lys Gly Glu Thr Ala Leu Leu Glu Cys 

165 170 175 

Gly Pro Pro Lys Gly He Pro Glu Pro Thr Leu He Trp He Lys Asp 

180 185 190 

Gly Val Pro Leu Asp Asp Leu Lys Ala Met Ser Phe Gly Ala Ser Ser 

195 200 205 

Arg Val Arg He Val Asp Gly Gly Asn Leu Leu He Ser Asn Val Glu 

210 215 220 

Pro He Asp Glu Gly Asn Tyr Lys Cys He Ala Gin Asn Leu Val Gly 
225 230 235 240 

Thr Arg Glu Ser Ser Tyr Ala Lys Leu He Val Gin Val Lys Pro Tyr 

245 250 255 

Phe Met Lys Glu Pro Lys Asp Gin Val Met Leu Tyr Gly Gin Thr Ala 

260 265 270 

Thr Phe His Cys Ser Val Gly Gly Asp Pro Pro Pro Lys Val Leu Trp 

275 280 285 

Lys Lys Glu Glu Gly Asn He Pro Val Ser Arg Ala Arg He Leu His 

290 295 300 

Asp Glu Lys Ser Leu Glu He Ser Asn He Thr Pro Thr Asp Glu Gly 
305 310 315 320 

Thr Tyr Val Cys Glu Ala His Asn Asn Val Gly Gin He Ser Ala Arg 
325 330 335 


Ala Ser Leu lie 
340 

Asn Lys Lys Val 
355 

Ser Gly Asn Pro 
370 

Thr Leu Met Phe 
385 

Asp Gly Thr Leu 

Tyr Val Cys Ser 
420 

Phe Leu Gin Val 
435 

He Gly Pro Ala 
450 

Pro Cys Arg Ala 
465 

Asp Gly His Ala 

Ser Ser Leu Arg 
500 

Thr Cys Thr Ala 
515 

Leu Thr Val Glu 
530 

Pro Ser Thr Tyr 
545 

Ser Arg Thr Ser 

Gly Ala Val Gly 
580 

Asp Leu Gin Thr 
595 

Gin Val Thr He 
610 

Val Arg Ala Glu 
625 

Asn Val He Lys 

Asp Leu Ser Ala 
660 

He Asp Ala Ser 
675 

Leu His Val Ser 
690 

Tyr Lys Asp Ala 
705 

Met Asp Ala Ser 

Thr Lys Tyr Glu 
740 

Gin Pro Ser Asn 
755 


Val His Ala Pro 

Gly Leu Asn Gly 
360 

Pro Pro Ser Val 
375 

Pro Asn Ser Ser 
390 

Gin He Thr Asp 
405 

Ala Phe Ser Val 

Ser Ser Val Asp 
440 

Asn Gin Thr Leu 
455 

Thr Gly Asn Pro 
470 

Val Gin Ala Gly 
485 

Val Asp Asp Leu 

Ser Gly Glu Arg 
520 

Lys Pro Gly Ser 
535 

Pro Ala Pro Pro 
550 

He Ser Leu Arg 
565 

Pro He He Gly 

Gly Trp He Val 
600 

Ser Gly Leu Thr 
615 

Asn Thr Gin Gly 
630 

Thr He Glu Ala 
645 

Ala Arg Thr Leu 

Ala He Asn Ala 
680 

Ala Asp Glu Lys 
695 

Ser Val Pro Ser 
710 

Ala Glu Ser Phe 
725 

Phe Phe Leu Thr 

Ser Lys Thr Ala 
760 


Pro Asn Phe Thr 
345 

Val Val Gin Leu 

Phe Trp Thr Lys 
380 

His Gly Arg Gin 
395 

Val Arg Gin Glu 
410 

Val Asp Ser Ser 
425 

Glu Arg Pro Pro 

Pro Lys Gly Ser 
460 

Ser Pro Arg He 
475 

Asn Arg Tyr Ser 
490 

Gin Leu Ser Asp 
505 

Gly Glu Thr Ser 

Thr Ser Leu His 
540 

Gly Thr Pro Lys 
555 

Trp Ala Lys Ser 
570 

Tyr Thr Val Glu 
585 

Ala Ala His Arg 

Pro Gly Thr Ser 
620 

He Ser Val Pro 
635 

Asp Phe Asp Ala 
650 

Leu Thr Gly Lys 
665 

Ser Ala Val Arg 

Tyr Val Glu Gly 
700 

Ala Gin Tyr His 
715 

Val Val Gly Asn 
730 

Pro Phe Phe Glu 
745 

Leu Thr Tyr Glu 


Lys Arg Pro Ser 
350 

Pro Cys Met Ala 
365 

Glu Gly Val Ser 

Tyr Val Ala Ala 
400 

Asp Glu Gly Tyr 
415 

Thr Val Arg Val 
430 

Pro He He Gin 
445 

Val Ala Thr Leu 

Lys Trp Phe His 
480 

He He Gin Gly 
495 

Ser Gly Thr Tyr 
510 

Trp Ala Ala Thr 
525 

Arg Ala Ala Asp 

Val Leu Asn Val 
560 

Gin Glu Lys Pro 
575 

Tyr Phe Ser Pro 
590 

Val Gly Asp Thr 
605 

Tyr Val Phe Leu 

Ser Gly Leu Ser 
640 

Ala Ser Ala Asn 
655 

Ser Val Glu Leu 
670 

Leu Glu Trp Met 
685 

Leu Arg He His 

Ser He Thr Val 
720 

Leu Lys Lys Tyr 
735 

Thr He Glu Gly 
750 

Asp Val Pro Ser 
765 


Ala Pro Pro Asp 
770 

Trp Val Arg Trp 
785 

Tyr Gly Tyr Lys 

Ala Asn Met Thr 
820 

Leu Thr Thr Gly 
835 

Ala Gly Asp Gly 
850 

Thr His His Val 
865 

Gly Arg His Glu 

Pro Pro Gly Asp 
900 

Leu Ser Gly Pro 
915 

Leu Val lie Ser 
930 

Gin Met Thr Lys 
945 

lie Thr Ala Leu 

His Arg Gly Trp 
980 

Glu Ser Lys Leu 
995 

Asn Ser Asp Gly 
1010 

Thr Thr Phe Tyr 
1025 

Ala Thr Thr Met 

Thr Thr Ser lie 
1061 

Ser Asp Ala Phe 
1075 

Asn Tyr Leu Gin 
1090 

Pro Pro Pro Pro 
1105 

Gin Gly Ser Pro 

Gly lie Ser Thr 
114i 

Ser Ser Gly Gly 
1155 

Ala Cys Pro Pro 
1170 

Gly Gly Thr Gin 
1185 


Asn lie Gin lie 
775 

Thr Pro Pro Pro 
790 

He Glu Val Ser 
805 

Leu Asn Ala Thr 

Ala Val Tyr Ser 
840 

Pro Tyr Ser Lys 
855 

His Pro Pro Arg 
870 

Gly Gin Asp Leu 
885 

He Asn Pro Thr 

Trp Leu Met Val 
920 

Ala Ala He Ser 
935 

Glu Leu Gly His 
950 

Asn He Asn Ser 
965 

Arg Thr Ala Asp 

Leu Ser His Val 
100i 

Gly Thr Asp Tyr 
1015 

Asn Cys Arg Lys 

1030 
He He Gly Thr 
1045 

Ser Ala Asp Lys 

> 

Ala Gly Gin Val 
108i 

Tyr Pro Val Glu 
1095 

Glu His Pro Pro 
1110 

Glu Ser Ser Arg 
1125 

Asn Gin Ser He 

> 

Phe Ser Ala Trp 
116i 

Glu Asn Val Tyr 
1175 

Asn Arg Tyr Gin 
1190 


Gly Met Tyr Asn 
780 

Ser Gin His His 
795 

Ala Gly Asn Thr 
810 

Thr Thr Ser Val 
825 

Val Arg Leu Asn 

Pro He Ser Leu 
860 

Ala His Pro Ser 
875 

Thr Tyr His Asn 
890 

Thr His Lys Lys 
905 

Leu Val Cys He 

Met Val Tyr Phe 
940 

Leu Ser Val Val 
955 

Lys Glu Ser Leu 
970 

Thr Asp Lys Asp 
985 

Asn Ser Ser Gin 

i 

Ala Glu Val Asp 
1021 

Ser Pro Asp Asn 
1035 

Ser Ser Ser Glu 
1050 

Asp Ser Gly Thr 
1065 

Pro Ala Val Pro 

i 

Pro He Asn Trp 
1101 

Pro Ser Ser Thr 
1115 

Lys Ser Ser Lys 
1130 

Leu Asn Ala Ser 
1145 

Gly Val Ser Pro 

i 

Ser Asn Pro Leu 
118i 

He Thr Pro Thr 
1195 


Gin Thr Ala Gly 

Asn Gly Asn Leu 
800 

Met Lys Val Leu 
815 

Leu Leu Asn Asn 
830 

Ser Phe Thr Lys 
845 

Phe Met Asp Pro 

Gly Thr His Asp 
880 

Asn Gly Asn lie 
895 

Thr Thr Asp Tyr 
910 

Val Leu Leu Val 
925 

Lys Arg Lys His 

Ser Asp Asn Glu 
960 

Trp He Asp His 
975 

Ser Gly Leu Ser 
990 

Ser Asn Tyr Asn 
1005 

Thr Arg Asn Leu 

Pro Thr Pro Tyr 

1040 
Thr Cys Thr Lys 

1055 
His Ser Pro Tyr 
1070 

Val Val Lys Ser 
1085 

Ser Glu Phe Leu 

> 

Tyr Gly Tyr Ala 

1120 
Ser Ala Gly Ser 

1135 
He His Ser Ser 

1150 
Gin Tyr Ala Val 
1165 

Ser Ala Val Ala 

i 

Asn Gin His Pro 
1200 


Pro Gin Leu Pro Ala Tyr Phe Ala Thr Thr Gly Pro Gly Gly Ala Val 

1205 1210 1215 

Pro Pro Asn His Leu Pro Phe Ala Thr Gin Arg His Ala Ala Ser Glu 

1220 1225 1230 

Tyr Gin Ala Gly Leu Asn Ala Ala Arg Cys Ala Gin Ser Arg Ala Cys 

1235 1240 1245 

Asn Ser Cys Asp Ala Leu Ala Thr Pro Ser Pro Met Gin Pro Pro Pro 

1250 1255 1260 

Pro Val Pro Val Pro Glu Gly Trp Tyr Gin Pro Val His Pro Asn Ser 
1265 1270 1275 1280 

His Pro Met His Pro Thr Ser Ser Asn His Gin He Tyr Gin Cys Ser 

1285 1290 1295 

Ser Glu Cys Ser Asp His Ser Arg Ser Ser Gin Ser His Lys Arg Gin 

1300 1305 1310 

Leu Gin Leu Glu Glu His Gly Ser Ser Ala Lys Gin Arg Gly Gly His 

1315 1320 1325 

His Arg Arg Arg Ala Pro Val Val Gin Pro Cys Met Glu Ser Glu Asn 

1330 1335 1340 

Glu Asn Met Leu Ala Glu Tyr Glu Gin Arg Gin Tyr Thr Ser Asp Cys 
1345 1350 1355 1360 

Cys Asn Ser Ser Arg Glu Gly Asp Thr Cys Ser Cys Ser Glu Gly Ser 

1365 1370 1375 

Cys Leu Tyr Ala Glu Ala Gly Glu Pro Ala Pro Arg Gin Met Thr Ala 
1380 1385 1390 

Lys Asn Thr 
1395 

(2) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4146 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
{ D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 
GGTGAAAATC CACGCATCAT CGAGCATCCC ATGGACACGA CGGTGCCAAA AAATGATCCA 6 0 

TTTACGTTTA ATTGCCAGGC CGAGGGCAAT CCAACACCAA CCATTCAATG GTTTAAGGAC 12 0 

GGTCGCGAAC TGAAGACGGA TACGGGTTCG CATCGCATAA TGCTGCCCGC CGGGGGTCTA 18 0 

TTCTTTCTCA AGGTTATCCA CTCACGTAGA GAGAGCGATG CGGGCACTTA CTGGTGCGAG 24 0 

GCCAAAAACG AGTTTGGAGT GGCACGGTCC AGGAATGCAA CGTTGCAAGT GGCAGTTCTC 3 00 

CGCGACGAAT TCCGTTTGGA GCCGGCAAAT ACCCGCGTGG CCCAAGGCGA GGTGGCCCTG 360 
ATGGAATGCG GTGCCCCCCG AGGATCTCCG GAGCCGCAAA TCTCGTGGCG CAAGAACGGC 42 0 

CAGACCCTGA ATCTTGTCGG GAACAAGCGG ATTCGCATTG TCGACGGTGG CAATCTGGCC 480 
ATCCAGGAAG CCCGCCAATC GGACGACGGA CGCTACCAGT GTGTGGTCAA GAATGTGGTT 540 
GGCACCCGGG AGTCGGCCAC CGCTTTTCTT AAAGTGCATG TACGTCCATT CCTCATCCGA 600 
GGACCCCAGA ATCAGACGGC GGTGGTGGGC AGCTCGGTGG TCTTCCAGTG CCGCATCGGA 660 
GGCGATCCCC TGCCTGATGT CCTGTGGCGA CGCACTGCCT CCGGCGGCAA TATGCCACTG 720 
CGTAAGTTTT CTTGGCTTCA TTCAGCTTCA GGTCGTGTGC ACGTACTTGA GGACCGCAGT 780 
CTGAAGCTGG ACGACGTTAC TCTGGAGGAC ATGGGCGAGT ACACTTGCGA GGCGGACAAT 840 
GCGGTGGGCG GCATCACGGC CACTGGCATC CTCACCGTTC ACGCTCCCCC CAAATTTGTG 900 
ATACGCCCCA AGAATCAGCT GGTGGAGATC GGTGATGAAG TGCTGTTCGA GTGCCAAGCG 960 
AATGGACATC CCCGACCAAC GCTCTACTGG TCGGTGGAGG GCAACAGCTC CCTGCTGCTC 1020 
CCCGGCTATC GGGATGGCCG CATGGAAGTG ACCCTGACGC CCGAGGGGCG CTCGGTGCTC 1080 
TCGATAGCTC GATTTGCCCG TGAGGATTCC GGAAAGGTGG TCACTTGCAA CGCCCTGAAC 1140 



GCCGTGGGCA GCGTCAGCAG TCGGACTGTG GTCAGTGTGG ATACGCAATT CGAGCTGCCA 12 00 

CCGCCGATTA TCGAACAGGG GCCCGTGAAT CAAACGTTGC CCGTTAAATC AATTGTGGTT 1260 

CTGCCATGCC GAACTCTGGG CACTCCAGTG CCACAGGTCT CTTGGTACCT GGATGGCATA 1320 

CCCATCGATG TGCAGGAGCA CGAGCGGCGG AATCTTTCGG ACGCTGGAGC CTTAACCATT 1380 

TCGGATCTTC AGCGCCACGA GGATGAAGGC TTGTACACCT GCGTGGCCAG CAATCGCAAC 1440 

GGAAAATCCT CTTGGAGTGG TTACCTTCGT CTGGACACCC CGACAAATCC GAATATCAAG 1500 

TTCTTCAGAG CCCCAGAACT TTCCACCTAC CCAGGGCCGC CAGGAAAACC GCAAATGGTG 1560 

GAGAAGGGCG AAAATTCGGT GACTCTCAGC TGGACGAGGA GCAACAAGGT GGGCGGCTCC 162 0 

AGTCTGGTGG GCTATGTAAT CGAGATGTTT GGCAAAAACG AAACGGATGG CTGGGTGGCT 168 0 

GTGGGCACTA GGGTGCAAAA TACCACGTTT ACCCAAACGG GTCTGCTGCC GGGTGTGAAT 174 0 

TACTTCTTTC TAATTCGAGC CGAGAACTCC CATGGCTTAT CACTGCCCAG TCCGATGTCG 1800 

GAACCCATTA CGGTGGGAAC GCGCTACTTC AATAGTGGTC TGGATCTGAG CGAGGCTCGT 1860 

GCCAGTCTGC TGTCCGGAGA TGTTGTGGAG CTGAGCAACG CCAGTGTGGT GGACTCCACT 192 0 

AGCATGAAAC TCACCTGGCA GATCATCAAT GGCAAATACG TCGAGGGCTT CTATGTCTAT 1980 

GCGAGACAGT TGCCAAATCC AATAGTCAAC AATCCGGCGC CCGTTACTAG CAATACCAAT 2 040 

CCGCTGCTGG GCTCTACATC CACATCCGCA TCCGCATCCG CCTCGGCATC GGCATTGATT 2100 

TCGACAAAGC CAAATATTGC AGCTGCCGGC AAACGTGATG GGGAGACAAA CCAGAGTGGA 2160 

GGAGGAGCTC CGACCCCACT GAACACCAAG TATCGCATGC TAACGATTCT CAATGGCGGT 222 0 

GGCGCCTCAT CCTGCACCAT CACCGGGCTC GTCCAGTACA CGCTGTATGA ATTTTTCATC 2280 

GTGCCATTTT ACAAATCCGT CGAGGGCAAG CCGTCGAATT CGCGCATCGC TCGCACCCTT 234 0 

GAAGATGTTC CCTCTGAGGC ACCATATGGA ATGGAGGCTC TGCTGTTGAA CTCCTCCGCG 2400 

GTCTTCCTCA AATGGAAGGC ACCAGAACTC AAGGATCGGC ATGGTGTTCT CTTGAACTAT 2460 

CATGTTATAG TCCGAGGTAT TGACACTGCC CACAATTTCT CACGCATTTT GACAAATGTC 2 52 0 

ACCATCGATG CCGCTTCGCC TACTCTGGTT TTGGCCAATC TCACCGAAGG CGTCATGTAC 2 58 0 

ACCGTGGGCG TGGCGGCCGG AAATAACGCT GGAGTTGGTC CTTATTGTGT CCCAGCTACT 264 0 

TTGCGTTTGG ATCCCATCAC AAAGCGACTC GATCCGTTCA TCAATCAGCG GGACCATGTT 2700 

AACGATGTGC TGACGCAGCC CTGGTTCATA ATACTCCTGG GCGCCATCCT GGCCGTTCTT 2 760 

ATGCTGTCCT TTGGCGCAAT GGTCTTTGTG AAGCGCAAGC ACATGATGAT GAAGCAGTCG 2 82 0 

GCCCTAAATA CAATGCGTGG CAATCACACG AGCGACGTGC TCAAAATGCC GAGTCTATCG 2 880 

GCGCGCAATG GAAACGGCTA CTGGCTGGAC TCCTCCACCG GCGGAATGGT GTGGCGTCCC 2 940 

TCGCCCGGCG GCGACTCGCT GGAGATGCAA AAGGATCACA TCGCCGACTA TGCGCCGGTC 3 000 

TGCGGTGCCC CCGGTTCTCC GGCCGGCGGT GGCACCTCTT CCGGTGGATC CGGTGGCGCG 3 060 

GGCAGCGGTG CCAGCGGCGG CGATGACATT CATGGAGGAC ACGGCAGCGA ACGCAATCAG 312 0 

CAGCGGTACG TGGGCGAGTA CTCCAACATA CCGACCGACT ATGCAGAGGT GTCCAGTTTT 3180 

GGCAAGGCAC CCAGCGAGTA TGGTCGGCAT GGCAACGCCT CCCCGGCCCC TTATGCCACC 324 0 

TCTTCGATCC TGAGTCCCCA CCAGCAGCAA CAGCAGCAGC AGCCGCGTTA TCAACAGCGA 33 00 

CCAGTGCCCG GCTATGGGCT CCAGCGCCCA ATGCACCCAC ACTACCAGCA GCAGCAGCAT 33 60 

CAGCAGCAAC AGGCGCAGCA GACGCACCAG CAACACCAGG CTCTCCAGCA GCACCAGCAA 342 0 

CTGCCACCCA GCAACATCTA CCAGCAGATG TCCACCACCA GCGAGATATA CCCCACGAAC 3480 

ACGGGTCCTT CGCGCTCTGT CTACTCTGAG CAGTATTACT ACCCCAAGGA CAAGCAGAGA 3540 

CACATCCACA TCACCGAGAA CAAGCTGAGC AACTGCCACA CCTATGAGGC GGCTCCTGGC 3600 

GCCAAGCAGT CCTCGCCGAT ATCCTCGCAG TTCGCCAGCG TGAGGCGGCA GCAGCTGCCG 3660 

CCCAACTGCA GCATCGGCAG GGAAAGTGCC CGCTTCAAGG TGCTAAACAC GGATCAGGGC 372 0 

AAGAACCAGC AGAATCTCCT GGATCTCGAC GGCTCCTCGA TGTGCTACAA CGGTCTGGCA 378 0 

GACTCGGGCT GCGGTGGATC TCCCTCCCCG ATGGCCATGC TGATGTCGCA CGAGGACGAG 3840 

CACGCGCTGT ACCACACGGC GGATGGGGAT CTGGACGACA TGGAACGACT GTACGTCAAG 3 900 

GTGGACGAGC AGCAGCCTCC ACAGCAGCAG CAGCAGCTGA TTCCCCTGGT CCCACAGCAT 3 960 

CCGGCGGAAG GTCACCTGCA GTCCTGGCGG AATCAGAGCA CGCGGAGCAG TCGGAAGAAC 4 02 0 

GGCCAGGAAT GCATCAAGGA ACCCAGCGAG TTGATCTACG CTCCGGGAAG CGTGGCCAGC 4 080 

GAACGGAGCC TCCTCAGCAA CTCGGGTAGC GGCACCAGCA GCCAGCCAGC TGGCCACAAT 4140 

GTCTGA 4146 


(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 


(A) LENGTH: 1381 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Gly Glu Asn Pro Arg lie lie Glu His Pro Met Asp Thr Thr Val Pro 

15 10 15 

Lys Asn Asp Pro Phe Thr Phe Asn Cys Gin Ala Glu Gly Asn Pro Thr 

20 25 30 

Pro Thr He Gin Trp Phe Lys Asp Gly Arg Glu Leu Lys Thr Asp Thr 

35 40 45 

Gly Ser His Arg He Met Leu Pro Ala Gly Gly Leu Phe Phe Leu Lys 

50 55 60 

Val He His Ser Arg Arg Glu Ser Asp Ala Gly Thr Tyr Trp Cys Glu 
65 70 75 80 

Ala Lys Asn Glu Phe Gly Val Ala Arg Ser Arg Asn Ala Thr Leu Gin 

85 90 95 

Val Ala Val Leu Arg Asp Glu Phe Arg Leu Glu Pro Ala Asn Thr Arg 

100 105 110 

Val Ala Gin Gly Glu Val Ala Leu Met Glu Cys Gly Ala Pro Arg Gly 

115 120 125 

Ser Pro Glu Pro Gin He Ser Trp Arg Lys Asn Gly Gin Thr Leu Asn 

130 135 140 

Leu Val Gly Asn Lys Arg He Arg He Val Asp Gly Gly Asn Leu Ala 
145 150 155 160 

He Gin Glu Ala Arg Gin Ser Asp Asp Gly Arg Tyr Gin Cys Val Val 

165 170 175 

Lys Asn Val Val Gly Thr Arg Glu Ser Ala Thr Ala Phe Leu Lys Val 

180 185 190 

His Val Arg Pro Phe Leu He Arg Gly Pro Gin Asn Gin Thr Ala Val 

195 200 205 

Val Gly Ser Ser Val Val Phe Gin Cys Arg He Gly Gly Asp Pro Leu 

210 215 220 

Pro Asp Val Leu Trp Arg Arg Thr Ala Ser Gly Gly Asn Met Pro Leu 
225 230 235 240 

Arg Lys Phe Ser Trp Leu His Ser Ala Ser Gly Arg Val His Val Leu 

245 250 255 

Glu Asp Arg Ser Leu Lys Leu Asp Asp Val Thr Leu Glu Asp Met Gly 

260 265 270 

Glu Tyr Thr Cys Glu Ala Asp Asn Ala Val Gly Gly He Thr Ala Thr 

275 280 285 

Gly He Leu Thr Val His Ala Pro Pro Lys Phe Val He Arg Pro Lys 

290 295 300 

Asn Gin Leu Val Glu He Gly Asp Glu Val Leu Phe Glu Cys Gin Ala 
305 310 315 320 

Asn Gly His Pro Arg Pro Thr Leu Tyr Trp Ser Val Glu Gly Asn Ser 

325 330 335 

Ser Leu Leu Leu Pro Gly Tyr Arg Asp Gly Arg Met Glu Val Thr Leu 

340 345 350 

Thr Pro Glu Gly Arg Ser Val Leu Ser lie Ala Arg Phe Ala Arg Glu 

355 360 365 

Asp Ser Gly Lys Val Val Thr Cys Asn Ala Leu Asn Ala Val Gly Ser 
370 375 380 


Val Ser Ser Arg 
385 

Pro Pro lie lie 

Ser lie Val Val 
420 

Val Ser Trp Tyr 
435 

Arg Arg Asn Leu 
450 

Arg His Glu Asp 
465 

Gly Lys Ser Ser 

Pro Asn lie Lys 
500 

Pro Pro Gly Lys 
515 

Leu Ser Trp Thr 
530 

Tyr Val He Glu 
545 

Val Gly Thr Arg 

Pro Gly Val Asn 
580 

Leu Ser Leu Pro 
595 

Tyr Phe Asn Ser 
610 

Ser Gly Asp Val 
625 

Ser Met Lys Leu 

Phe Tyr Val Tyr 
660 

Ala Pro Val Thr 
675 

Ser Ala Ser Ala 
690 

Asn He Ala Ala 
705 

Gly Gly Ala Pro 

Leu Asn Gly Gly 
740 

Tyr Thr Leu Tyr 
755 

Gly Lys Pro Ser 
770 

Ser Glu Ala Pro 
785 

Val Phe Leu Lys 


Thr Val Val Ser 
390 

Glu Gin Gly Pro 
405 

Leu Pro Cys Arg 

Leu Asp Gly He 
440 

Ser Asp Ala Gly 
455 

Glu Gly Leu Tyr 
470 

Trp Ser Gly Tyr 
485 

Phe Phe Arg Ala 

Pro Gin Met Val 
520 

Arg Ser Asn Lys 
535 

Met Phe Gly Lys 
550 

Val Gin Asn Thr 
565 

Tyr Phe Phe Leu 

Ser Pro Met Ser 
600 

Gly Leu Asp Leu 
615 

Val Glu Leu Ser 
630 

Thr Trp Gin He 
645 

Ala Arg Gin Leu 

Ser Asn Thr Asn 
680 

Ser Ala Ser Ala 
695 

Ala Gly Lys Arg 
710 

Thr Pro Leu Asn 
725 

Gly Ala Ser Ser 

Glu Phe Phe He 
760 

Asn Ser Arg He 
775 

Tyr Gly Met Glu 
790 

Trp Lys Ala Pro 
805 


Val Asp Thr Gin 
395 

Val Asn Gin Thr 
410 

Thr Leu Gly Thr 
425 

Pro He Asp Val 

Ala Leu Thr He 
460 

Thr Cys Val Ala 
475 

Leu Arg Leu Asp 
490 

Pro Glu Leu Ser 
505 

Glu Lys Gly Glu 

Val Gly Gly Ser 
540 

Asn Glu Thr Asp 
555 

Thr Phe Thr Gin 
570 

He Arg Ala Glu 
585 

Glu Pro He Thr 

Ser Glu Ala Arg 
620 

Asn Ala Ser Val 
635 

He Asn Gly Lys 
650 

Pro Asn Pro He 
665 

Pro Leu Leu Gly 

Ser Ala Leu He 
700 

Asp Gly Glu Thr 
715 

Thr Lys Tyr Arg 
730 

Cys Thr He Thr 
745 

Val Pro Phe Tyr 

Ala Arg Thr Leu 
780 

Ala Leu Leu Leu 
795 

Glu Leu Lys Asp 
810 


Phe Glu Leu Pro 
400 

Leu Pro Val Lys 
415 

Pro Val Pro Gin 
430 

Gin Glu His Glu 
445 

Ser Asp Leu Gin 

Ser Asn Arg Asn 
480 

Thr Pro Thr Asn 
495 

Thr Tyr Pro Gly 
510 

Asn Ser Val Thr 
525 

Ser Leu Val Gly 

Gly Trp Val Ala 
560 

Thr Gly Leu Leu 
575 

Asn Ser His Gly 
590 

Val Gly Thr Arg 
605 

Ala Ser Leu Leu 

Val Asp Ser Thr 
640 

Tyr Val Glu Gly 
655 

Val Asn Asn Pro 
670 

Ser Thr Ser Thr 
685 

Ser Thr Lys Pro 

Asn Gin Ser Gly 
720 

Met Leu Thr He 
735 

Gly Leu Val Gin 
750 

Lys Ser Val Glu 
765 

Glu Asp Val Pro 

Asn Ser Ser Ala 
800 

Arg His Gly Val 
815 



Leu Leu 

Asn 

Tyr 
820 

His 

Val 

He 

Val 

Arg 
825 

Gly 

He 

Asp 

Thr 

Ala 
830 

His 

Asn 

Phe Ser 

Arg 
835 

He 

Leu 

Thr 

Asn 

Val 
840 

Thr 

He 

Asp 

Ala 

Ala 
845 

Ser 

Pro 

Thr 

Leu Val 

Leu 

Ala 

Asn 

Leu 

Thr 

Glu 

Gly 

Val 

Met 

Tyr 

Thr 

Val 

Gly 

Val 

850 





855 





860 





Ala Ala 

Gly 

Asn 

Asn 

Ala 

Gly 

Val 

Gly 

Pro 

Tyr 

Cys 

Val 

Pro 

Ala 

Thr 

865 




870 





875 





880 

Leu Arg 

Leu 

Asp 

Pro 
885 

He 

Thr 

Lys 

Arg 

Leu 
890 

Asp 

Pro 

Phe 

He 

Asn 
895 

Gin 

Arg Asp 

His 

Val 
900 

Asn 

Asp 

Val 

Leu 

Thr 
905 

Gin 

Pro 

Trp 

Phe 

He 
910 

He 

Leu 

Leu Gly 

Ala 
915 

He 

Leu 

Ala 

Val 

Leu 
920 

Met 

Leu 

Ser 

Phe 

Gly 
925 

Ala 

Met 

Val 

Phe Val 

Lys 

Arg 

Lys 

His 

Met 

Met 

Met 

Lys 

Gin 

Ser 

Ala 

Leu 

Asn 

Thr 

930 





935 





940 





Met Arg 

Gly 

Asn 

His 

Thr 

Ser 

Asp 

Val 

Leu 

Lys 

Met 

Pro 

Ser 

Leu 

Ser 

945 




950 





955 





960 

Ala Arg 

Asn 

Gly 

Asn 
965 

Gly 

Tyr 

Trp 

Leu 

Asp 
970 

Ser 

Ser 

Thr 

Gly 

Gly 
975 

Met 

Val Trp 

Arg 

Pro 
980 

Ser 

Pro 

Gly 

Gly 

Asp 
985 

Ser 

Leu 

Glu 

Met 

Gin 
990 

Lys 

Asp 

His He 

Ala 

Asp 

Tyr 

Ala 

Pro 

Val 

Cys 

Gly 

Ala 

Pro 

Gly 

Ser 

Pro 

Ala 


995 





1000 




1005 



Gly Gly 

Gly 

Thr 

Ser 

Ser 

Gly 

Gly Ser 

Gly 

Gly 

Ala 

Gly 

Ser 

Gly 

Ala 

1010 




1015 




1020 




Ser Gly Gly 

Asp 

Asp 

He 

His 

Gly Gly 

His 

Gly 

Ser Glu Arg 

Asn 

Gin 

1025 




1030 




1035 




1041 

Gin Arg Tyr 

Val 

Gly 

Glu 

Tyr 

Ser 

Asn 

He 

Pro 

Thr 

Asp 

Tyr 

Ala 

Glu 




1045 




1050 




1055 

Val Ser 

Ser 

Phe 

Gly Lys Ala 

Pro 

Ser 

Glu 

Tyr Gly Arg His 

Gly Asn 



1060 




1065 




1070 


Ala Ser 

Pro 

Ala 

Pro 

Tyr 

Ala 

Thr 

Ser 

Ser 

He 

Leu 

Ser 

Pro 

His 

Gin 


1075 




1080 




1085 



Gin Gin 

Gin 

Gin 

Gin 

Gin 

Pro 

Arg 

Tyr 

Gin 

Gin 

Arg 

Pro 

Val 

Pro 

Gly 

1090 




1095 




1100 




Tyr Gly Leu Gin Arg 

Pro 

Met 

His 

Pro 

His 

Tyr 

Gin 

Gin 

Gin 

Gin 

His 

1105 




1110 




1115 




112< 

Gin Gin 

Gin 

Gin 

Ala 

Gin 

Gin 

Thr 

His 

Gin 

Gin 

His 

Gin 

Ala 

Leu 

Gin 




1125 




1130 




1135 

Gin His 

Gin 

Gin 

Leu 

Pro 

Pro 

Ser 

Asn 

He 

Tyr 

Gin 

Gin 

Met 

Ser 

Thr 



1140 




1145 




1150 


Thr Ser 

Glu 

He 

Tyr 

Pro Thr Asn Thr Gly 

Pro 

Ser 

Arg 

Ser 

Val 

Tyr 


1155 




1160 




1165 



Ser Glu 

Gin 

Tyr 

Tyr 

Tyr 

Pro 

Lys 

Asp 

Lys 

Gin Arg His 

He 

His 

He 

1170 




1175 




1180 




Thr Glu 

Asn 

Lys 

Leu 

Ser 

Asn 

Cys 

His 

Thr 

Tyr 

Glu 

Ala 

Ala 

Pro 

Gly 

1185 




1190 




1195 




120) 

Ala Lys 

Gin 

Ser 

Ser 

Pro 

He 

Ser 

Ser 

Gin 

Phe 

Ala 

Ser 

Val 

Arg 

Arg 




1205 




1210 




1215 

Gin Gin 

Leu 

Pro 

Pro 

Asn 

Cys 

Ser 

He 

Gly Arg Glu 

Ser 

Ala 

Arg 

Phe 



1220 




1225 




1230 


Lys Val 

Leu Asn Thr Asp Gin Gly Lys 

Asn 

Gin 

Gin 

Asn 

Leu 

Leu 

Asp 


1235 1240 1245 


Leu Asp Gly Ser Ser Met Cys Tyr Asn Gly Leu Ala Asp Ser Gly Cys 

1250 1255 1260 

Gly Gly Ser Pro Ser Pro Met Ala Met Leu Met Ser His Glu Asp Glu 
1265 1270 1275 1280 

His Ala Leu Tyr His Thr Ala Asp Gly Asp Leu Asp Asp Met Glu Arg 

1285 1290 1295 

Leu Tyr Val Lys Val Asp Glu Gin Gin Pro Pro Gin Gin Gin Gin Gin 

1300 1305 1310 

Leu lie Pro Leu Val Pro Gin His Pro Ala Glu Gly His Leu Gin Ser 

1315 1320 1325 

Trp Arg Asn Gin Ser Thr Arg Ser Ser Arg Lys Asn Gly Gin Glu Cys 

1330 1335 1340 

He Lys Glu Pro Ser Glu Leu He Tyr Ala Pro Gly Ser Val Ala Ser 
1345 1350 1355 1360 

Glu Arg Ser Leu Leu Ser Asn Ser Gly Ser Gly Thr Ser Ser Gin Pro 

1365 1370 1375 

Ala Gly His Asn Val 
1380 


(2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3894 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 


ATGTACTATC 

TAGGTTTTTA 

CCACACTCAC 

ACACACACAC 

ACACATACAT 

AAATTTTGAT 

60 

AAAATTCCTA 

ATGCCTCAAA 

TCTCGCTCCC 

GTGATAATCG 

AACATCCCAT 

CGATGTGGTG 

120 

GTATCTAGGG 

GATCGCCAGC 

AACCCTCAAC 

TGTGGTGCAA 

AGCCATCTAC 

CGCCAAAATC 

180 

ACATGGTACA 

AGGATGGACA 

GCCCGTAATC 

ACGAATAAGG 

AGCAAGTGAA 

CAGCCACCGG 

240 

ATTGTTCTCG 

ACACGGGATC 

CCTGTTTCTT 

CTGAAAGTGA 

ATAGTGGAAA 

AAACGGAAAA 

300 

GACAGCGATG 

CGGGAGCGTA 

CTATTGTGTG 

GCCAGCAACG 

AGCACGGAGA 

AGTGAAGTCG 

360 

AACGAAGGAT 

CGTTAAAATT 

GGCGATGCTT 

CGCGAAGACT 

TTCGAGTTCG 

GCCAAGAACA 

420 

GTTCAGGCTC 

TTGGTGGAGA 

GATGGCCGTT 

CTGGAATGCA 

GTCCGCCACG 

TGGATTCCCG 

480 

GAGCCGGTTG 

TGAGCTGGCG 

GAAAGACGAC 

AAAGAGCTCC 

GAATTCAAGA 

CATGCCACGA 

540 

TACACTCTAC 

ACTCTGACGG 

AAACCTCATC 

ATTGATCCGG 

TCGATCGAAG 

CGATTCTGGT 

600 

ACTTATCAGT 

GTGTTGCCAA 

CAACATGGTC 

GGAGAACGGG 

TGTCCAATCC 

CGCAAGATTG 

660 

AGTGTCTTTG 

AGAAACCAAA 

GTTTGAGCAA 

GAACCCAAGG 

ACATGACGGT 

CGACGTCGGA 

720 

GCCGCAGTGC 

TGTTTGATTG 

TCGTGTGACT 

GGAGATCCTC 

AACCACAAAT 

TACGTGGAAA 

780 

CGCAAAAATG 

AGCCGATGCC 

AGTTACACGT 

GCATACATTG 

CCAAGGATAA 

TCGGGGGTTG 

840 

AGAATCGAAA 

GAGTTCAACC 

ATCAGACGAA 

GGTGAATACG 

TTTGCTATGC 

ACGAAATCCA 

900 

GCGGGAACTC 

TTGAAGCATC 

TGCACATCTT 

CGTGTCCAGG 

CACCTCCATC 

CTTCCAGACA 

960 

AAACCAGCAG 

ACCAGTCAGT 

TCCAGCTGGA 

GGCACGGCAA 

CTTTTGAATG 

CACCTTGGTC 

1020 

GGTCAACCGA 

GTCCCGCCTA 

TTTTTGGAGC 

AAGGAAGGCC 

AACAGGATCT 

TCTTTTCCCA 

1080 

AGTTATGTGT 

CCGCTGATGG 

TAGAACGAAA 

GTTTCACCAA 

CTGGAACATT 

GACAATTGAG 

1140 

GAAGTTCGTC 

AAGTTGATGA 

GGGAGCTTAT 

GTGTGCGCTG 

GAATGAACTC 

GGCAGGAAGC 

1200 

TCGTTGAGCA 

AGGCAGCTTT 

GAAAGCAACA 

TTTGAAACCA 

AAGGCCGTGT 

CCAAAAAAAA 

1260 

AAGAGCAAAA 

TGGGCAAACA 

GAAACAAAAA 

AATGTTCAAT 

CAATTATCAA 

ATATTTAATT 

1320 

TCAGCCGTGA 

CCGGAAACAC 

ACCCGCCAAA 

CCACCACCAA 

CAATCGAGCA 

TGGTCATCAA 

1380 

AATCAGACCC 

TTATGGTTGG 

ATCATCAGCC 

ATCCTTCCAT 

GTCAGGCTAG 

CGGAAAACCA 

1440 

ACTCCAGGAA 

TATCATGGCT 

CAGGGATGGG 

CTACCTATTG 

ACATTACAGA 

TAGTCGTATC 

1500 

AGTCAACATT 

CAACGGGAAG 

TCTACATATT 

GCCGATTTAA 

AGAAACCTGA 

CACCGGAGTT 

1560 

TACACTTGCA 

TTGCGAAGAA 

CGAGGATGGA 

GAGTCAACAT 

GGTCGGCATC 

TCTGACTGTT 

1620 


GAAGATCACA CTAGCAATGC ACAATTTGTT CGGATGCCGG ATCCATCGAA CTTCCCGTCT 168 0 

TCTCCAACGC AACCCATTAT TGTCAATGTC ACTGATACCG AAGTAGAGCT CCACTGGAAT 174 0 

GCTCCCTCCA CATCTGGCGC AGGACCAATC ACTGGTTATA TCATTCAGTA CTACAGTCCA 1800 

GACCTCGGAC AGACGTGGTT TAACATTCCA GACTACGTGG CATCTACTGA ATATAGAATA 1860 

AAGGGTCTGA AACCATCTCA CTCGTATATG TTTGTGATTC GAGCAGAAAA TGAGAAAGGT 192 0 

ATTGGAACGC CGAGTGTGTC GTCGGCTCTC GTTACCACTA GCAAGCCAGC AGCTCAAGTT 1980 

GCGCTTTCTG ACAAGAACAA AATGGACATG GCCATCGCTG AGAAGAGACT CACTTCGGAA 2 04 0 

CAACTCATAA AACTCGAGGA AGTGAAGACT ATTAATTCTA CGGCCGTTCG TTTGTTCTGG 2100 

AAGAAGAGGA AACTTGAAGA GCTGATTGAT GGTTACTACA TCAAGTGGAG AGGGCCTCCA 2160 

AGAACCAATG ATAATCAATA CGTGAATGTG ACCAGCCCTA GCACCGAAAA CTATGTTGTT 222 0 

TCAAATTTAA TGCCATTCAC CAACTATGAG TTTTTCGTGA TTCCTTATCA TTCCGGAGTT 228 0 

CATAGTATTC ATGGAGCACC GAGTAATTCC ATGGACGTGT TGACCGCCGA AGCTCCACCT 234 0 

TCATTGCCAC CAGAGGATGT GCGAATCCGT ATGCTCAACC TGACCACTCT TCGTATCTCT 2400 

TGGAAAGCAC CAAAAGCCGA CGGCATCAAC GGAATTCTCA AAGGATTCCA AATTGTTATT 2460 

GTTGGTCAAG CGCCCAACAA CAATCGGAAC ATCACTACAA ACGAGAGAGC TGCCAGTGTT 2520 

ACTCTGTTCC ATTTAGTGAC TGGAATGACG TATAAAATTC GTGTAGCGGC TAGAAGCAAT 2580 

GGTGGAGTTG GAGTCTCACA TGGAACGAGT GAAGTCATCA TGAATCAAGA CACGCTGGAA 2 640 

AAACACCTTG CTGCTCAACA AGAAAACGAA TCATTTTTGT ATGGGCTGAT CAATAAATCT 2700 

CATGTTCCTG TGATTGTCAT TGTTGCAATT CTGATTATTT TCGTAGTCAT CATTATAGCC 2 760 

TATTGTTACT GGAGGAATAG CAGAAACAGT GATGGAAAGG ATCGAAGTTT TATAAAGATC 2 820 

AATGATGGAA GTGTTCATAT GGCTTCGAAT AATCTTTGGG ATGTTGCACA AAATCCGAAT 2880 

CAGAATCCAA TGTACAACAC TGCTGGAAGA ATGACTATGA ACAATAGAAA TGGCCAGGCT 2 940 

CTCTATTCGC TGACACCAAA TGCGCAAGAC TTTTTCAACA ATTGTGATGA CTACAGTGGA 3 000 

ACGATGCACA GACCAGGATC CGAGCATCAC TATCATTATG CTCAACTGAC TGGCGGACCT 3 060 

GGTAATGCGA TGTCTACTTT TTATGGAAAC CAATATCACG ATGATCCATC TCCATATGCC 3120 

ACCACAACAC TGGTCCTGTC GAACCAACAA CCAGCTTGGC TCAATGACAA AATGCTTCGC 3180 

GCGCCAGCAA TGCCAACAAA TCCCGTGCCA CCAGAGCCAC CGGCGCGATA TGCAGATCAT 324 0 

ACCGCTGGAA GACGATCTCG ATCGAGCCGT GCATCCGATG GGAGAGGAAC TCTGAATGGC 33 00 

GGACTCCATC ACCGGACTAG CGGAAGTCAA CGGTCGGATA GTCCACCTCA CACAGATGTG 33 60 

AGCTATGTTC AGCTTCACTC ATCCGATGGA ACTGGTAGTA GTAAGGAAAG AACTGGGGAG 342 0 

CGGAGAACAC CACCGAATAA GACTCTGATG GACTTTATTC CGCCACCACC TTCCAATCCA 348 0 

CCACCACCTG GAGGGCACGT TTATGACACA GCAACTAGGC GTCAGTTGAA TCGTGGAAGT 354 0 

ACTCCACGAG AAGACACCTA CGATTCGGTC AGTGACGGAG CTTTTGCTCG GGTTGATGTG 3600 

AATGCAAGGC CAACGAGTCG GAATCGGAAT TTGGGAGGAA GGCCGCTGAA AGGGAAACGA 3660 

GACGACGATA GTCAGCGGTC TTCGTTGATG ATGGACGATG ATGGTGGATC TTCTGAAGCT 3 720 

GACGGGGAGA ACTCTGAAGG AGACGTTCCG CGTGGAGGTG TTAGAAAAGC AGTTCCTCGA 3 780 

ATGGGTATCT CTGCAAGTAC GCTGGCTCAT AGTTGTTACG GGACAAACGG CACTGCTCAA 3 840 

CGATTCCGGT CAATTCCACG TAACAATGGA ATCGTCACAC AAGAACAAAC TTGA 3 8 94 


(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 97 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 

Met Tyr Tyr Leu Gly Phe Tyr His Thr His Thr His Thr His Thr Tyr 

15 10 15 

lie Asn Phe Asp Lys lie Pro Asn Ala Ser Asn Leu Ala Pro Val lie 

20 25 30 

He Glu His Pro He Asp Val Val Val Ser Arg Gly Ser Pro Ala Thr 

35 40 45 

Leu Asn Cys Gly Ala Lys Pro Ser Thr Ala Lys He Thr Trp Tyr Lys 



• 


50 


55 


60 


Asp Gly Gin Pro Val lie Thr Asn Lys Glu Gin Val Asn Ser His Arg 
65 70 75 80 

lie Val Leu Asp Thr Gly Ser Leu Phe Leu Leu Lys Val Asn Ser Gly 

85 90 95 

Lys Asn Gly Lys Asp Ser Asp Ala Gly Ala Tyr Tyr Cys Val Ala Ser 

100 105 110 

Asn Glu His Gly Glu Val Lys Ser Asn Glu Gly Ser Leu Lys Leu Ala 

115 120 125 

Met Leu Arg Glu Asp Phe Arg Val Arg Pro Arg Thr Val Gin Ala Leu 

130 135 140 

Gly Gly Glu Met Ala Val Leu Glu Cys Ser Pro Pro Arg Gly Phe Pro 
145 150 155 160 

Glu Pro Val Val Ser Trp Arg Lys Asp Asp Lys Glu Leu Arg lie Gin 

165 170 175 

Asp Met Pro Arg Tyr Thr Leu His Ser Asp Gly Asn Leu lie lie Asp 

180 185 190 

Pro Val Asp Arg Ser Asp Ser Gly Thr Tyr Gin Cys Val Ala Asn Asn 

195 200 205 

Met Val Gly Glu Arg Val Ser Asn Pro Ala Arg Leu Ser Val Phe Glu 

210 215 220 

Lys Pro Lys Phe Glu Gin Glu Pro Lys Asp Met Thr Val Asp Val Gly 
225 230 235 240 

Ala Ala Val Leu Phe Asp Cys Arg Val Thr Gly Asp Pro Gin Pro Gin 

245 250 255 

lie Thr Trp Lys Arg Lys Asn Glu Pro Met Pro Val Thr Arg Ala Tyr 

260 265 270 

lie Ala Lys Asp Asn Arg Gly Leu Arg lie Glu Arg Val Gin Pro Ser 

275 280 285 

Asp Glu Gly Glu Tyr Val Cys Tyr Ala Arg Asn Pro Ala Gly Thr Leu 

290 295 300 

Glu Ala Ser Ala His Leu Arg Val Gin Ala Pro Pro Ser Phe Gin Thr 
305 310 315 320 

Lys Pro Ala Asp Gin Ser Val Pro Ala Gly Gly Thr Ala Thr Phe Glu 

325 330 335 

Cys Thr Leu Val Gly Gin Pro Ser Pro Ala Tyr Phe Trp Ser Lys Glu 

340 345 350 

Gly Gin Gin Asp Leu Leu Phe Pro Ser Tyr Val Ser Ala Asp Gly Arg 

355 360 365 

Thr Lys Val Ser Pro Thr Gly Thr Leu Thr lie Glu Glu Val Arg Gin 

370 375 380 

Val Asp Glu Gly Ala Tyr Val Cys Ala Gly Met Asn Ser Ala Gly Ser 
385 390 395 400 

Ser Leu Ser Lys Ala Ala Leu Lys Ala Thr Phe Glu Thr Lys Gly Arg 

405 410 415 

Val Gin Lys Lys Lys Ser Lys Met Gly Lys Gin Lys Gin Lys Asn- Val 

420 425 430 

Gin Ser lie lie Lys Tyr Leu lie Ser Ala Val Thr Gly Asn Thr Pro 

435 440 445 

Ala Lys Pro Pro Pro Thr lie Glu His Gly His Gin Asn Gin Thr Leu 

450 455 460 

Met Val Gly Ser Ser Ala lie Leu Pro Cys Gin Ala Ser Gly Lys Pro 
465 470 475 480 

Thr Pro Gly lie Ser Trp Leu Arg Asp Gly Leu Pro lie Asp lie Thr 


485 490 495 

Asp Ser Arg lie Ser Gin His Ser Thr Gly Ser Leu His lie Ala Asp 

500 505 510 

Leu Lys Lys Pro Asp Thr Gly Val Tyr Thr Cys lie Ala Lys Asn Glu 

515 520 525 

Asp Gly Glu Ser Thr Trp Ser Ala Ser Leu Thr Val Glu Asp His Thr 

530 535 540 

Ser Asn Ala Gin Phe Val Arg Met Pro Asp Pro Ser Asn Phe Pro Ser 
545 550 555 560 

Ser Pro Thr Gin Pro He He Val Asn Val Thr Asp Thr Glu Val Glu 

565 570 575 

Leu His Trp Asn Ala Pro Ser Thr Ser Gly Ala Gly Pro He Thr Gly 

580 585 590 

Tyr He He Gin Tyr Tyr Ser Pro Asp Leu Gly Gin Thr Trp Phe Asn 

595 600 605 

He Pro Asp Tyr Val Ala Ser Thr Glu Tyr Arg He Lys Gly Leu Lys 

610 615 620 

Pro Ser His Ser Tyr Met Phe Val He Arg Ala Glu Asn Glu Lys Gly 
625 630 635 640 

He Gly Thr Pro Ser Val Ser Ser Ala Leu Val Thr Thr Ser Lys Pro 

645 650 655 

Ala Ala Gin Val Ala Leu Ser Asp Lys Asn Lys Met Asp Met Ala He 

660 665 670 

Ala Glu Lys Arg Leu Thr Ser Glu Gin Leu He Lys Leu Glu Glu Val 

675 680 685 

Lys Thr He Asn Ser Thr Ala Val Arg Leu Phe Trp Lys Lys Arg Lys 

690 695 700 

Leu Glu Glu Leu He Asp Gly Tyr Tyr He Lys Trp Arg Gly Pro Pro 
705 710 715 720 

Arg Thr Asn Asp Asn Gin Tyr Val Asn Val Thr Ser Pro Ser Thr Glu 

725 730 735 

Asn Tyr Val Val Ser Asn Leu Met Pro Phe Thr Asn Tyr Glu Phe Phe 

740 745 750 

Val He Pro Tyr His Ser Gly Val His Ser He His Gly Ala Pro Ser 

755 760 765 

Asn Ser Met Asp Val Leu Thr Ala Glu Ala Pro Pro Ser Leu Pro Pro 

770 775 780 

Glu Asp Val Arg He Arg Met Leu Asn Leu Thr Thr Leu Arg He Ser 
785 790 795 800 

Trp Lys Ala Pro Lys Ala Asp Gly He Asn Gly He Leu Lys Gly Phe 

805 810 815 

Gin He Val He Val Gly Gin Ala Pro Asn Asn Asn Arg Asn He Thr 

820 825 830 

Thr Asn Glu Arg Ala Ala Ser Val Thr Leu Phe His Leu Val Thr Gly 

835 840 845 

Met Thr Tyr Lys He Arg Val Ala Ala Arg Ser Asn Gly Gly Val Gly 

850 855 860 

Val Ser His Gly Thr Ser Glu Val He Met Asn Gin Asp Thr Leu Glu 
865 870 875 880 

Lys His Leu Ala Ala Gin Gin Glu Asn Glu Ser Phe Leu Tyr Gly Leu 

885 890 895 

He Asn Lys Ser His Val Pro Val He Val lie Val Ala He Leu He 

900 905 910 

He Phe Val Val He He He Ala Tyr Cys Tyr Trp Arg Asn Ser Arg 




915 




920 


925 



Asn 

Ser Asp 

Gly 

Lys 

Asp 

Arg Ser Phe 

He 

Lys He Asn Asp 

Gly 

Ser 


930 




935 


940 



Val 

His Met 

Ala 

Ser 

Asn 

Asn Leu Trp 

Asp 

Val Ala Gin Asn 

Pro 

Asn 

945 




950 



955 


960 

Gin 

Asn Pro 

Met 

Tyr 

Asn 

Thr Ala Gly 

Arg 

Met Thr Met Asn 

Asn 

Arg 




965 



970 


975 


Asn 

Gly Gin 

Ala 

Leu 

Tyr 

Ser Leu Thr 

Pro 

Asn Ala Gin Asp 

Phe 

Phe 



980 



985 


990 



Asn 

Asn Cys 

Asp 

Asp 

Tyr 

Ser Gly Thr 

Met 

His Arg Pro Gly 

Ser 

Glu 


995 




1000 


1005 



His 

His Tyr 

His 

Tyr 

Ala 

Gin Leu Thr 

Gly 

Gly Pro Gly Asn 

Ala 

Met 


1010 




1015 


1020 



Ser 

Thr Phe 

Tyr 

Gly 

Asn 

Gin Tyr His 

Asp 

Asp Pro Ser Pro 

Tyr 

Ala 

1025 



1030 


1035 


1041 

Thr 

Thr Thr 

Leu 

Val 

Leu 

Ser Asn Gin 

Gin 

Pro Ala Trp Leu 

Asn 

Asp 




1045 


1050 

1055 

Lys 

Met Leu 

Arg 

Ala 

Pro 

Ala Met Pro 

Thr 

Asn Pro Val Pro 

Pro 

Glu 



1060 


1065 

1070 


Pro 

Pro Ala 

Arg Tyr Ala Asp His Thr Ala Gly Arg Arg Ser 

Arg 

Ser 


1075 



1080 


1085 



Ser Arg Ala Ser Asp Gly Arg Gly Thr Leu Asn Gly Gly Leu His 

His 


1090 




1095 


1100 



Arg 

Thr Ser Gly Ser Gin Arg Ser Asp 

Ser 

Pro Pro His Thr Asp Val 

1105 



1110 


1115 


112< 

Ser 

Tyr Val 

Gin 

Leu 

His 

Ser Ser Asp Gly Thr Gly Ser Ser 

Lys 

Glu 




1125 


1130 

1135 

Arg Thr Gly Glu Arg Arg 

Thr Pro Pro 

Asn 

Lys Thr Leu Met 

Asp 

Phe 



1140 


1145 

1150 


He 

Pro Pro 

Pro 

Pro 

Ser 

Asn Pro Pro 

Pro 

Pro Gly Gly His 

Val 

Tyr 


1155 



1160 


1165 



Asp Thr Ala Thr Arg Arg Gin Leu Asn Arg Gly Ser Thr Pro 

Arg 

Glu 


1170 




1175 


1180 



Asp 

Thr Tyr 

Asp 

Ser 

Val 

Ser Asp Gly Ala 

Phe Ala Arg Val 

Asp 

Val 

1185 



1190 


1195 


1201 

Asn 

Ala Arg 

Pro 

Thr 

Ser Arg Asn Arg Asn Leu Gly Gly Arg 

Pro 

Leu 




1205 


1210 

1215 

Lys 

Gly Lys 

Arg Asp 

Asp 

Asp Ser Gin 

Arg 

Ser Ser Leu Met 

Met 

Asp 



1220 


1225 

1230 


Asp 

Asp Gly Gly Ser 

Ser 

Glu Ala Asp 

Gly Glu Asn Ser Glu Gly Asp 


1235 



1240 


1245 



Val 

Pro Arg Gly Gly Val 

Arg Lys Ala 

Val 

Pro Arg Met Gly 

He 

Ser 


1250 




1255 


1260 



Ala 

Ser Thr 

Leu 

Ala 

His 

Ser Cys Tyr 

Gly Thr Asn Gly Thr Ala Gin 

1265 



1270 


1275 


128< 

Arg 

Phe Arg 

Ser 

He 

Pro 

Arg Asn Asn 

Gly 

He Val Thr Gin 

Glu 

Gin 


1285 1290 1295 


Thr 

(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4956 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 



(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: cDNA 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 


ATGAAATGGA 

AACATGTTCC 

TTTTTTGGTC 

ATGATATCAC 

TCCTCAGCTT 

ATCCCCAAAT 

60 

CACCTGTTTC 

TGGCCCAGCT 

TATTCCAGAC 

CCTGAAGATG 

TAGAGAGGGG 

GAACGACCAC 

120 

GGGACGCCAA 

TCCCCACCTC 

TGATAACGAT 

GACAATTCGC 

TGGGCTATAC 

AGGCTCCCGT 

180 

CTTCGTCAGG 

AAGATTTTCC 

ACCTCGCATT 

GTTGAACACC 

CTTCAGACCT 

GATTGTCTCA 

240 

AAAGGAGAAC 

CTGCAACTTT 

GAACTGCAAA 

GCTGAAGGCC 

GCCCCACACC 

CACTATTGAA 

300 

TGGTACAAAG 

GGGGAGAGAG 

AGTGGAGACA 

GACAAAGATG 

ACCCTCGCTC 

ACACCGAATG 

360 

TTGCTGCCGA 

GTGGATCTTT 

ATTTTTCTTA 

CGTATAGTAC 

ATGGACGGAA 

AAGTAGACCT 

420 

GATGAAGGAG 

TCTATGTCTG 

TGTAGCAAGG 

AATTACCTTG 

GAGAGGCTGT 

GAGCCACAAT 

480 

GCATCGCTGG 

AAGTAGCCAT 

ACTTCGGGAT 

GACTTCAGAC 

AAAACCCTTC 

GGATGTCATG 

540 

GTTGCAGTAG 

GAGAGCCTGC 

AGTAATGGAA 

TGCCAACCTC 

CACGAGGCCA 

TCCTGAGCCC 

600 

ACCATTTCAT 

GGAAGAAAGA 

TGGCTCTCCA 

CTGGATGATA 

AAGATGAAAG 

AATAACTATA 

660 

CGAGGAGGAA 

AGCTCATGAT 

CACTTACACC 

CGTAAAAGTG 

ACGCTGGCAA 

ATATGTTTGT 

720 

GTTGGTACCA 

ATATGGTTGG 

GGAACGTGAG 

AGTGAAGTAG 

CCGAGCTGAC 

TGTCTTAGAG 

780 

AGACCATCAT 

TTGTGAAGAG 

ACCCAGTAAC 

TTGGCAGTAA 

CTGTGGATGA 

CAGTGCAGAA 

840 

TTTAAATGTG 

AGGCCCGAGG 

TGACCCTGTA 

CCTACAGTAC 

GATGGAGGAA 

AGATGATGGA 

900 

GAGCTGCCCA 

AATCCAGATA 

TGAAATCCGA 

GATGATCATA 

CCTTGAAAAT 

TAGGAAGGTG 

960 

ACAGCTGGTG 

ACATGGGTTC 

ATACACTTGT 

GTTGCAGAAA 

ATATGGTGGG 

CAAAGCTGAA 

1020 

GCATCTGCTA 

CTCTGACTGT 

TCAAGAACCT 

CCACATTTTG 

TTGTGAAACC 

CCGTGACCAG 

1080 

GTTGTTGCTT 

TGGGACGGAC 

TGTAACTTTT 

CAGTGTGAAG 

CAACCGGAAA 

TCCTCAACCA 

1140 

GCTATTTTCT 

GGAGGAGAGA 

AGGGAGTCAG 

AATCTACTTT 

TCTCATATCA 

ACCACCACAG 

1200 

TCATCCAGCC 

GATTTTCAGT 

CTCCCAGACT 

GGCGACCTCA 

CAATTACTAA 

TGTCCAGCGA 

1260 

TCTGATGTTG 

GTTATTACAT 

CTGCCAGACT 

TTAAATGTTG 

CTGGAAGCAT 

CATCACAAAG 

1320 

GCATATTTGG 

AAGTTACAGA 

TGTGATTGCA 

GATCGGCCTC 

CCCCAGTTAT 

TCGACAAGGT 

1380 

CCTGTGAATC 

AGACTGTAGC 

CGTGGATGGC 

ACTTTCGTCC 

TCAGCTGTGT 

GGCCACAGGC 

1440 

AGTCCAGTGC 

CCACCATTCT 

GTGGAGAAAG 

GATGGAGTCC 

TCGTTTCAAC 

CCAAGACTCT 

1500 

CGAATCAAAC 

AGTTGGAGAA 

TGGAGTACTG 

CAGATCCGAT 

ATGCTAAGCT 

GGGTGATACT 

1560 

GGTCGGTACA 

CCTGCATTGC 

ATCAACCCCC 

AGTGGTGAAG 

CAACATGGAG 

TGCTTACATT 

1620 

GAAGTTCAAG 

AATTTGGAGT 

TCCAGTTCAG 

CCTCCAAGAC 

CTACTGACCC 

AAATTTAATC 

1680 

CCTAGTGCCC 

CATCAAAACC 

TGAAGTGACA 

GATGTCAGCA 

GAAATACAGT 

CACATTATCG 

1740 

TGGCAACCAA 

ATTTGAATTC 

AGGAGCAACT 

CCAACATCTT 

ATATTATAGA 

AGCCTTCAGC 

1800 

CATGCATCTG 

GTAGCAGCTG 

GCAGACCGTA 

GCAGAGAATG 

TGAAAACAGA 

AACATCTGCC 

1860 

ATTAAAGGAC 

TCAAACCTAA 

TGCAATTTAC 

CTTTTCCTTG 

TGAGGGCAGC 

TAATGCATAT 

1920 

GGAATTAGTG 

ATCCAAGCCA 

AATATCAGAT 

CCAGTGAAAA 

CACAAGATGT 

CCTACCAACA 

1980 

AGTCAGGGGG 

TGGACCACAA 

GCAGGTCCAG 

AGAGAGCTGG 

GAAATGCTGT 

TCTGCACCTC 

2040 

CACAACCCCA 

CCGTCCTTTC 

TTCCTCTTCC 

ATCGAAGTGC 

ACTGGACAGT 

AGATCAACAG 

2100 

TCTCAGTATA 

TACAAGGATA 

TAAAATTCTC 

TATCGGCCAT 

CTGGAGCCAA 

CCACGGAGAA 

2160 

TCAGACTGGT 

TAGTTTTTGA 

AGTGAGGACG 

CCAGCCAAAA 

ACAGTGTGGT 

AATCCCTGAT 

2220 

CTCAGAAAGG 

GAGTCAACTA 

TGAAATTAAG 

GCTCGCCCTT 

TTTTTAATGA 

ATTTCAAGGA 

2280 

GCAGATAGTG 

AAATCAAGTT 

TGCCAAAACC 

CTGGAAGAAG 

CACCCAGTGC 

CCCACCCCAA 

2340 

GGTGTAACTG 

TATCCAAGAA 

TGATGGAAAC 

GGAACTGCAA 

TTCTAGTTAG 

TTGGCAGCCA 

2400 

CCTCCAGAAG 

ACACTCAAAA 

TGGAATGGTC 

CAAGAGTATA 

AGGTTTGGTG 

TCTGGGCAAT 

2460 

GAAACTCGAT 

ACCACATCAA 

CAAAACAGTG 

GATGGTTCCA 

CCTTTTCCGT 

GGTCATTCCC 

2520 

TTTCTTGTTC 

CTGGAATCCG 

ATACAGTGTG 

GAAGTGGCAG 

CCAGCACTGG 

GGCTGGGTCT 

2580 

GGGGTAAAGA 

GTGAGCCTCA 

GTTCATCCAG 

CTGGATGCCC 

ATGGAAACCC 

TGTGTCACCT 

2640 

GAGGACCAAG 

TCAGCCTCGC 

TCAGCAGATT 

TCAGATGTGG 

TGAAGCAGCC 

GGCCTTCATA 

2700 

GCAGGTATTG 

GAGCAGCCTG 

TTGGATCATC 

CTCATGGTCT 

TCAGCATCTG 

GCTTTATCGA 

2760 

CACCGCAAGA 

AGAGAAACGG 

ACTTACTAGT 

ACCTACGCGG 

GTATCAGAAA 

AGTCCCGTCT 

2820 

TTTACCTTCA 

CACCAACAGT 

AACTTACCAG 

AGAGGAGGCG 

AAGCTGTCAG 

CAGTGGAGGG 

2880 

AGGCCTGGAC 

TTCTCAACAT 

CAGTGAACCT 

GCCGCGCAGC 

CATGGCTGGC 

AGACACGTGG 

2940 

CCTAATACTG 

GCAACAACCA 

CAATGACTGC 

TCCATCAGCT 

GCTGCACGGC 

AGGCAATGGA 

3000 

AACAGCGACA 

GCAACCTCAC 

TACCTACAGT 

CGCCCAGCTG 

ATTGTATAGC 

AAATTATAAC 

3060 



AACCAACTGG ATAACAAACA AACAAATCTG ATGCTCCCTG AGTCAACTGT TTATGGTGAT 312 0 

GTGGACCTTA GTAACAAAAT CAATGAGATG AAAACCTTCA ATAGCCCAAA TCTGAAGGAT 3180 

GGGCGTTTTG TCAATCCATC AGGGCAGCCT ACTCCTTACG CCACCACTCA GCTCATCCAG 3240 

TCAAACCTCA GCAACAACAT GAACAATGGC AGCGGGGACT CTGGCGAGAA GCACTGGAAA 3 300 

CCACTGGGAC AGCAGAAACA AGAAGTGGCA CCAGTTCAGT ACAACATCGT GGAGCAAAAC 3 360 

AAGCTGAACA AAGATTATCG AGCAAATGAC ACAGTTCCTC CAACTATCCC ATACAACCAA 342 0 

TCATACGACC AGAACACAGG AGGATCCTAC AACAGCTCAG ACCGGGGCAG TAGTACATCT 3480 

GGGAGTCAGG GGCACAAGAA AGGGGCAAGA ACACCCAAGG TACCAAAACA GGGTGGCATG 3 540 

AACTGGGCAG ACCTGCTTCC TCCTCCCCCA GCACATCCTC CTCCACACAG CAATAGCGAA 3 600 

GAGTACAACA TTTCTGTAGA TGAAAGCTAT GACCAAGAAA TGCCATGTCC CGTGCCACCA 3 660 

GCAAGGATGT ATTTGCAACA AGATGAATTA GAAGAGGAGG AAGATGAACG AGGCCCCACT 372 0 

CCCCCTGTTC GGGGAGCAGC TTCTTCTCCA GCTGCCGTGT CCTATAGCCA TCAGTCCACT 3 780 

GCCACTCTGA CTCCCTCCCC ACAGGAAGAA CTCCAGCCCA TGTTACAGGA TTGTCCAGAG 3 840 

GAGACTGGCC ACATGCAGCA CCAGCCCGAC AGGAGACGGC AGCCTGTGAG TCCTCCTCCA 3 900 

CCACCACGGC CGATCTCCCC TCCACATACC TATGGCTACA TTTCAGGACC CCTGGTCTCA 3 960 

GATATGGATA CGGATGCGCC AGAAGAGGAA GAAGACGAAG CCGACATGGA GGTAGCCAAG 4020 

ATGCAAACCA GAAGGCTTTT GTTACGTGGG CTTGAGCAGA CACCTGCCTC CAGTGTTGGG 4080 

GACCTGGAGA GCTCTGTCAC GGGGTCCATG ATCAACGGCT GGGGCTCAGC CTCAGAGGAG 4140 

GACAACATTT CCAGCGGACG CTCCAGTGTT AGTTCTTCGG ACGGCTCCTT TTTCACTGAT 42 00 

GCTGACTTTG CCCAGGCAGT CGCAGCAGCG GCAGAGTATG CTGGTCTGAA AGTAGCACGA 42 60 

CGGCAAATGC AGGATGCTGC TGGCCGTCGA CATTTTCATG CGTCTCAGTG CCCTAGGCCC 432 0 

ACAAGTCCCG TGTCTACAGA CAGCAACATG AGTGCCGCCG TAATGCAGAA AACCAGACCA 43 80 

GCCAAGAAAC TGAAACACCA GCCAGGACAT CTGCGCAGAG AAACCTACAC AGATGATCTT 4440 

CCACCACCTC CTGTGCCGCC ACCTGCTATA AAGTCACCTA CTGCCCAATC CAAGACACAG 4500 

CTGGAAGTAC GACCTGTAGT GGTGCCAAAA CTCCCTTCTA TGGATGCAAG AACAGACAGA 4560 

TCATCAGACA GAAAAGGAAG CAGTTACAAG GGGAGAGAAG TGTTGGATGG AAGACAGGTT 462 0 

GTTGACATGC GAACAAATCC AGGTGATCCC AGAGAAGCAC AGGAACAGCA AAATGACGGG 4680 

AAAGGACGTG GAAACAAGGC AGCAAAACGA GACCTTCCAC CAGCAAAGAC TCATCTCATC 4740 

CAAGAGGATA TTCTACCTTA TTGTAGACCT ACTTTTCCAA CATCAAATAA TCCCAGAGAT 4800 

CCCAGTTCCT CAAGCTCAAT GTCATCAAGA GGATCAGGAA GCAGACAAAG AGAACAAGCA 4860 

AATGTAGGTC GAAGAAATAT TGCAGAAATG CAGGTACTTG GAGGATATGA AAGAGGAGAA 4 92 0 

GATAATAATG AAGAATTAGA GGAAACTGAA AGCTGA 4 956 

(2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1651 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 

Met Lys Trp Lys His Val Pro Phe Leu Val Met lie Ser Leu Leu Ser 

15 10 15 

Leu Ser Pro Asn His Leu Phe Leu Ala Gin Leu lie Pro Asp Pro Glu 

20 25 30 

Asp Val Glu Arg Gly Asn Asp. His Gly Thr Pro lie Pro Thr Ser Asp 

35 40 45 

Asn Asp Asp Asn Ser Leu Gly Tyr Thr Gly Ser Arg Leu Arg Gin Glu 

50 55 60 

Asp Phe Pro Pro Arg lie Val Glu His Pro Ser Asp Leu lie Val Ser 
65 70 75 80 

Lys Gly Glu Pro Ala Thr Leu Asn Cys Lys Ala Glu Gly Arg Pro Thr 

85 90 95 

Pro Thr He Glu Trp Tyr Lys Gly Gly Glu Arg Val Glu Thr Asp Lys 


100 105 110 

Asp Asp Pro Arg Ser His Arg Met Leu Leu Pro Ser Gly Ser Leu Phe 

115 120 125 

Phe Leu Arg lie Val His Gly Arg Lys Ser Arg Pro Asp Glu Gly Val 

130 135 140 

Tyr Val Cys Val Ala Arg Asn Tyr Leu Gly Glu Ala Val Ser His Asn 
145 150 155 160 

Ala Ser Leu Glu Val Ala lie Leu Arg Asp Asp Phe Arg Gin Asn Pro 

165 170 175 

Ser Asp Val Met Val Ala Val Gly Glu Pro Ala Val Met Glu Cys Gin 

180 185 190 

Pro Pro Arg Gly His Pro Glu Pro Thr lie Ser Trp Lys Lys Asp Gly 

195 200 205 

Ser Pro Leu Asp Asp Lys Asp Glu Arg lie Thr lie Arg Gly Gly Lys 

210 215 220 

Leu Met He Thr Tyr Thr Arg Lys Ser Asp Ala Gly Lys Tyr Val Cys 
225 230 235 240 

Val Gly Thr Asn Met Val Gly Glu Arg Glu Ser Glu Val Ala Glu Leu 

245 250 255 

Thr Val Leu Glu Arg Pro Ser Phe Val Lys Arg Pro Ser Asn Leu Ala 

260 265 270 

Val Thr Val Asp Asp Ser Ala Glu Phe Lys Cys Glu Ala Arg Gly Asp 

275 280 285 

Pro Val Pro Thr Val Arg Trp Arg Lys Asp Asp Gly Glu Leu Pro Lys 

290 295 300 

Ser Arg Tyr Glu He Arg Asp Asp His Thr Leu Lys He Arg Lys Val 
305 310 315 320 

Thr Ala Gly Asp Met Gly Ser Tyr Thr Cys Val Ala Glu Asn Met Val 

325 330 335 

Gly Lys Ala Glu Ala Ser Ala Thr Leu Thr Val Gin Glu Pro Pro His 

340 345 350 

Phe Val Val Lys Pro Arg Asp Gin Val Val Ala Leu Gly Arg Thr Val 

355 360 365 

Thr Phe Gin Cys Glu Ala Thr Gly Asn Pro Gin Pro Ala He Phe Trp 

370 375 380 

Arg Arg Glu Gly Ser Gin Asn Leu Leu Phe Ser Tyr Gin Pro Pro Gin 
385 390 395 400 

Ser Ser Ser Arg Phe Ser Val Ser Gin Thr Gly Asp Leu Thr He Thr 

405 410 415 

Asn Val Gin Arg Ser Asp Val Gly Tyr Tyr He Cys Gin Thr Leu Asn 

420 425 430 

Val Ala Gly Ser He He Thr Lys Ala Tyr Leu Glu Val Thr Asp Val 

435 440 445 

He Ala Asp Arg Pro Pro Pro Val lie Arg Gin Gly Pro Val Asn Gin 

450 455 460 

Thr Val Ala Val Asp Gly Thr Phe Val Leu Ser Cys Val Ala Thr Gly 
465 470 475 480 

Ser Pro Val Pro Thr He Leu Trp Arg Lys Asp Gly Val Leu Val Ser 

485 490 495 

Thr Gin Asp Ser Arg He Lys Gin Leu Glu Asn Gly Val Leu Gin He 

500 505 510 

Arg Tyr Ala Lys Leu Gly Asp Thr Gly Arg Tyr Thr Cys He Ala Ser 

515 520 525 

Thr Pro Ser Gly Glu Ala Thr Trp Ser Ala Tyr He Glu Val Gin Glu 



530 


535 


540 


Phe Gly Val Pro Val Gin Pro Pro Arg Pro Thr Asp Pro Asn Leu lie 
545 550 555 560 

Pro Ser Ala Pro Ser Lys Pro Glu Val Thr Asp Val Ser Arg Asn Thr 

565 570 575 

Val Thr Leu Ser Trp Gin Pro Asn Leu Asn Ser Gly Ala Thr Pro Thr 

580 585 590 

Ser Tyr He He Glu Ala Phe Ser His Ala Ser Gly Ser Ser Trp Gin 

595 600 605 

Thr Val Ala Glu Asn Val Lys Thr Glu Thr Ser Ala He Lys Gly Leu 

610 615 620 

Lys Pro Asn Ala He Tyr Leu Phe Leu Val Arg Ala Ala Asn Ala Tyr 
625 630 635 640 

Gly He Ser Asp Pro Ser Gin He Ser Asp Pro Val Lys Thr Gin Asp 

645 650 655 

Val Leu Pro Thr Ser Gin Gly Val Asp His Lys Gin Val Gin Arg Glu 

660 665 670 

Leu Gly Asn Ala Val Leu His Leu His Asn Pro Thr Val Leu Ser Ser 

675 680 685 

Ser Ser He Glu Val His Trp Thr Val Asp Gin Gin Ser Gin Tyr He 

690 695 700 

Gin Gly Tyr Lys He Leu Tyr Arg Pro Ser Gly Ala Asn His Gly Glu 
705 710 715 720 

Ser Asp Trp Leu Val Phe Glu Val Arg Thr Pro Ala Lys Asn Ser Val 

725 730 735 

Val He Pro Asp Leu Arg Lys Gly Val Asn Tyr Glu He Lys Ala Arg 

740 745 750 

Pro Phe Phe Asn Glu Phe Gin Gly Ala Asp Ser Glu He Lys Phe Ala 

755 760 765 

Lys Thr Leu Glu Glu Ala Pro Ser Ala Pro Pro Gin Gly Val Thr Val 

770 775 780 

Ser Lys Asn Asp Gly Asn Gly Thr Ala He Leu Val Ser Trp Gin Pro 
785 790 795 800 

Pro Pro Glu Asp Thr Gin Asn Gly Met Val Gin Glu Tyr Lys Val Trp 

805 810 815 

Cys Leu Gly Asn Glu Thr Arg Tyr His lie Asn Lys Thr Val Asp Gly 

820 825 830 

Ser Thr Phe Ser Val Val He Pro Phe Leu Val Pro Gly He Arg Tyr 

835 840 845 

Ser Val Glu Val Ala Ala Ser Thr Gly Ala Gly Ser Gly Val Lys Ser 

850 855 860 

Glu Pro Gin Phe He Gin Leu Asp Ala His Gly Asn Pro Val Ser Pro 
865 870 875 880 

Glu Asp Gin Val Ser Leu Ala Gin Gin He Ser Asp Val Val Lys Gin 

885 890 895 

Pro Ala Phe He Ala Gly He Gly Ala Ala Cys Trp He He Leu Met 

900 905 910 

Val Phe Ser He Trp Leu Tyr Arg His Arg Lys Lys Arg Asn Gly Leu 

915 920 925 

Thr Ser Thr Tyr Ala Gly He Arg Lys Val Pro Ser Phe Thr Phe Thr 

930 935 940 

Pro Thr Val Thr Tyr Gin Arg Gly Gly Glu Ala Val Ser Ser Gly Gly 
945 950 955 960 

Arg Pro Gly Leu Leu Asn He Ser Glu Pro Ala Ala Gin Pro Trp Leu 



965 


970 


975 


Ala 

Asp 

Thr 

Trp 

Pro Asn Thr Gly Asn 

Asn 

His 

Asn Asp 

Cys 

Ser 

He 




980 

985 




990 



Ser 

Cys 

Cys 

Thr 

Ala Gly Asn Gly Asn 

Ser 

Asp 

Ser Asn 

Leu 

Thr 

Thr 



995 


1000 



1005 



Tyr 

Ser 

Arg 

Pro 

Ala Asp Cys lie Ala 

Asn 

Tyr 

Asn Asn 

Gin 

Leu 

Asp 


1010 


1015 



1020 




Asn 

Lys 

Gin 

Thr 

Asn Leu Met Leu Pro 

Glu 

Ser 

Thr Val 

Tyr 

Gly Asp 

1025 



1030 


1035 



1041 

Val 

Asp 

Leu 

Ser 

Asn Lys lie Asn Glu 

Met 

Lys 

Thr Phe 

Asn 

Ser 

Pro 





1045 

1050 



1055 

Asn 

Leu 

Lys 

Asp 

Gly Arg Phe Val Asn 

Pro Ser Gly Gin Pro 

Thr 

Pro 




1060 1065 



1070 


Tyr 

Ala 

Thr 

Thr 

Gin Leu lie Gin Ser 

Asn 

Leu 

Ser Asn 

Asn 

Met 

Asn 


1075 1080 1085 


Asn Gly Ser Gly Asp Ser Gly Glu Lys His Trp Lys Pro Leu Gly Gin 

1090 1095 1100 

Gin Lys Gin Glu Val Ala Pro Val Gin Tyr Asn He Val Glu Gin Asn 
1105 1110 1115 1121 

Lys Leu Asn Lys Asp Tyr Arg Ala Asn Asp Thr Val Pro Pro Thr He 

1125 1130 1135 

Pro Tyr Asn Gin Ser Tyr Asp Gin Asn Thr Gly Gly Ser Tyr Asn Ser 

1140 1145 1150 

Ser Asp Arg Gly Ser Ser Thr Ser Gly Ser Gin Gly His Lys Lys Gly 

1155 1160 1165 

Ala Arg Thr Pro Lys Val Pro Lys Gin Gly Gly Met Asn Trp Ala Asp 

1170 1175 1180 

Leu Leu Pro Pro Pro Pro Ala His Pro Pro Pro His Ser Asn Ser Glu 
1185 1190 1195 120< 

Glu Tyr Asn He Ser Val Asp Glu Ser Tyr Asp Gin Glu Met Pro Cys 

1205 1210 1215 

Pro Val Pro Pro Ala Arg Met Tyr Leu Gin Gin Asp Glu Leu Glu Glu 

1220 1225 1230 

Glu Glu Asp Glu Arg Gly Pro Thr Pro Pro Val Arg Gly Ala Ala Ser 

1235 1240 1245 

Ser Pro Ala Ala Val Ser Tyr Ser His Gin Ser Thr Ala Thr Leu Thr 

1250 1255 1260 

Pro Ser Pro Gin Glu Glu Leu Gin Pro Met Leu Gin Asp Cys Pro Glu 
1265 1270 1275 1281 

Glu Thr Gly His Met Gin His Gin Pro Asp Arg Arg Arg Gin Pro Val 

1285 1290 1295 

Ser Pro Pro Pro Pro Pro Arg Pro lie Ser Pro Pro His Thr Tyr Gly 

1300 1305 1310 

Tyr He Ser Gly Pro Leu Val Ser Asp Met Asp Thr Asp Ala Pro Glu 

1315 1320 1325 

Glu Glu Glu Asp Glu Ala Asp Met Glu Val Ala Lys Met Gin Thr Arg 

1330 1335 1340 

Arg Leu Leu Leu Arg Gly Leu Glu Gin Thr Pro Ala Ser Ser Val Gly 
1345 1350 1355 136< 

Asp Leu Glu Ser Ser Val Thr Gly Ser Met lie Asn Gly Trp Gly Ser 

1365 1370 1375 

Ala Ser Glu Glu Asp Asn He Ser Ser Gly Arg Ser Ser Val Ser Ser 

1380 1385 1390 

Ser Asp Gly Ser Phe Phe Thr Asp Ala Asp Phe Ala Gin Ala Val Ala 


1395 1400 1405 

Ala Ala Ala Glu Tyr Ala Gly Leu Lys Val Ala Arg Arg Gin Met Gin 

1410 1415 1420 

Asp Ala Ala Gly Arg Arg His Phe His Ala Ser Gin Cys Pro Arg Pro 
1425 1430 1435 1440 

Thr Ser Pro Val Ser Thr Asp Ser Asn Met Ser Ala Ala Val Met Gin 

1445 1450 1455 

Lys Thr Arg Pro Ala Lys Lys Leu Lys His Gin Pro Gly His Leu Arg 

1460 1465 1470 

Arg Glu Thr Tyr Thr Asp Asp Leu Pro Pro Pro Pro Val Pro Pro Pro 

1475 1480 1485 

Ala lie Lys Ser Pro Thr Ala Gin Ser Lys Thr Gin Leu Glu Val Arg 

1490 1495 1500 

Pro Val Val Val Pro Lys Leu Pro Ser Met Asp Ala Arg Thr Asp Arg 
1505 1510 1515 1520 

Ser Ser Asp Arg Lys Gly Ser Ser Tyr Lys Gly Arg Glu Val Leu Asp 

1525 1530 1535 

Gly Arg Gin Val Val Asp Met Arg Thr Asn Pro Gly Asp Pro Arg Glu 

1540 1545 1550 

Ala Gin Glu Gin Gin Asn Asp Gly Lys Gly Arg Gly Asn Lys Ala Ala 

1555 1560 1565 

Lys Arg Asp Leu Pro Pro Ala Lys Thr His Leu lie Gin Glu Asp lie 

1570 1575 1580 

Leu Pro Tyr Cys Arg Pro Thr Phe Pro Thr Ser Asn Asn Pro Arg Asp 
1585 1590 1595 1600 

Pro Ser Ser Ser Ser Ser Met Ser Ser Arg Gly Ser Gly Ser Arg Gin 

1605 1610 1615 

Arg Glu Gin Ala Asn Val Gly Arg Arg Asn He Ala Glu Met Gin Val 

1620 1625 1630 

Leu Gly Gly Tyr Glu Arg Gly Glu Asp Asn Asn Glu Glu Leu Glu Glu 

1635 1640 1645 

Thr Glu Ser 
1650 

(2) INFORMATION FOR SEQ ID NO : 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 00 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: cDNA 
(ix) FEATURE: 

(A) NAME /KEY : misc_feature 

(B) LOCATION: 855.. 1187 

(D) OTHER INFORMATION: /note= "N signifies gap in sequence" 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 
CAGATTGTTG CTCAAGGTCG AACAGTGACA TTTCCCTGTG AAACTAAAGG AAACCCACAG 60 
CCAGCTGTTT TTTGGCAGAA AGAAGGCAGC CAGAACCTAC TTTTCCCAAA CCAACCCCAG 12 0 

CAGCCCAACA GTAGATGCTC AGTGTCACCA ACTGGAGACC TCACAATCAC CAACATTCAA 180 
CGTTCCGACG CGGGTTACTA CATCTGCCAG GCTTTAACTG TGGCAGGAAG CATTTTAGCA 24 0 

AAAGCTCAAC TGGAGGTTAC TGATGTTTTG ACAGATAGAC CTCCACCTAT AATTCTACAA 3 00 

GGCCCAGCCA ACCAAACGCT GGCAGTGGAT GGTACAGCGT TACTGAAATG TAAAGCCACT 360 
GGTGATCCTC TTCCTGTAAT TAGCTGGTTA AAGGAGGGAT TTACTTTTCC GGGTAGAGAT 42 0 

CCAAGAGCAA CAATTCAAGA GCAAGGCACA CTGCAGATTA AGAATTTACG GATTTCTGAT 48 0 


ACTGGCACTT ATACTTGTGT GGCTACAAGT TCAAGTGGAG AGGCTTCCTG GAGTGCAGTG 540 

CTGGATGTGA CAGAGTCTGG AGCAACAATC AGTAAAAACT ATGATTTAAG TGACCTGCCA 600 

GGGCCACCAT CCAAACCGCA AGTCACTGAT GTTACTAAGA ACAGTGTCAC CTTGTCCTGG 660 

CAGCCAGGTA CCCCTGGAAC CCTTCCAGCA AGTGCATATA TCATTGAGGC TTTCAGCCAA 72 0 

TCAGTGAGCA ACAGCTGGCA GACCGTGGCA AACCATGTAA AGACCACCCT CTATACTGTA 780 

AGAGGACTGC GGCCCAATAC AATCTACTTA TTCATGGTCA GAGCGATCAA CCCCAAGGTY 840 

TCAGTGACCC AAGTNAAACC ACAGAAAAAC AATGGATCCA CTTGGGCCAA TGTCCCTCTA 900 

CCTCCCCCCC CAGTCCAGCC CCTTCCTGGC ACGGAGCTGG AACACTATGC AGTGGAACAA 960 

CAAGAAAATG GCTATGACAG TGATAGCTGG TGCCCACCAT TGCCAGTACA AACTTACTTA 1020 

CACCAAGGTC TGGAAGATGA ACTGGAAGAA GATGATGATA GGGTCCCAAC ACCTCCTGTT 1080 

CGAGGCGTGG CTTCTTCTCC TGCTATCTCC TTTGGACAGC AGTCCACTGC AACTCTTACT 1140 

CCATCCCCAC GGGAAGAGAT GCAACCCATG CTGCAGGCTT CACCTNTTTA CCTCCTCTCA 1200 

AAGACCTCGA CCTACCAGCC CATTTTCTAC TGACAGTAAC ACCAGTGCAG CCCTGAGTCA 1260 

AAGTCAGAGG CCTCGGCCCA CTAAAAAACA CAAGGGAGGG 1300 


(2) INFORMATION FOR SEQ ID NO: 10: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 434 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME / KEY : Modified- site 

(B) LOCATION: 285.. 396 

(D) OTHER INFORMATION: /note= "Xaa signifies gap in sequence" 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Gin He Val Ala Gin Gly Arg Thr Val Thr Phe Pro Cys Glu Thr Lys 

15 10 15 

Gly Asn Pro Gin Pro Ala Val Phe Trp Gin Lys Glu Gly Ser Gin Asn 

20 25 30 

Leu Leu Phe Pro Asn Gin Pro Gin Gin Pro Asn Ser Arg Cys Ser Val 

35 40 45 

Ser Pro Thr Gly Asp Leu Thr He Thr Asn He Gin Arg Ser Asp Ala 

50 55 60 

Gly Tyr Tyr He Cys Gin Ala Leu Thr Val Ala Gly Ser He Leu Ala 
65 70 75 80 

Lys Ala Gin Leu Glu Val Thr Asp Val Leu Thr Asp Arg Pro Pro Pro 

85 90 95 

He He Leu Gin Gly Pro Ala Asn Gin Thr Leu Ala Val Asp Gly Thr 

100 105 110 

Ala Leu Leu Lys Cys Lys Ala Thr Gly Asp Pro Leu Pro Val He Ser 

115 120 125 

Trp Leu Lys Glu Gly Phe Thr Phe Pro Gly Arg Asp Pro Arg Ala Thr 

130 135 140 

He Gin Glu Gin Gly Thr Leu Gin He Lys Asn Leu Arg He Ser Asp 
145 150 155 160 

Thr Gly Thr Tyr Thr Cys Val Ala Thr Ser Ser Ser Gly Glu Ala Ser 

165 170 175 

Trp Ser Ala Val Leu Asp Val Thr Glu Ser Gly Ala Thr He Ser Lys 

180 185 190 

Asn Tyr Asp Leu Ser Asp Leu Pro Gly Pro Pro Ser Lys Pro Gin Val 

195 200 205 

Thr Asp Val Thr Lys Asn Ser Val Thr Leu Ser Trp Gin Pro Gly Thr 


210 215 220 

Pro Gly Thr Leu Pro Ala Ser Ala Tyr lie lie Glu Ala Phe Ser Gin 
225 230 235 240 

Ser Val Ser Asn Ser Trp Gin Thr Val Ala Asn His Val Lys Thr Thr 

245 250 255 

Leu Tyr Thr Val Arg Gly Leu Arg Pro Asn Thr lie Tyr Leu Phe Met 

260 265 270 

Val Arg Ala lie Asn Pro Lys Val Ser Val Thr Gin Xaa Lys Pro Gin 

275 280 285 

Lys Asn Asn Gly Ser Thr Trp Ala Asn Val Pro Leu Pro Pro Pro Pro 

290 295 300 

Val Gin Pro Leu Pro Gly Thr Glu Leu Glu His Tyr Ala Val Glu Gin 
305 310 315 320 

Gin Glu Asn Gly Tyr Asp Ser Asp Ser Trp Cys Pro Pro Leu Pro Val 

325 330 335 

Gin Thr Tyr Leu His Gin Gly Leu Glu Asp Glu Leu Glu Glu Asp Asp 

340 345 350 

Asp Arg Val Pro Thr Pro Pro Val Arg Gly Val Ala Ser Ser Pro Ala 

355 360 365 

He Ser Phe Gly Gin Gin Ser Thr Ala Thr Leu Thr Pro Ser Pro Arg 

370 375 380 

Glu Glu Met Gin Pro Met Leu Gin Ala Ser Pro Xaa Phe Thr Ser Ser 
385 390 395 400 

Gin Arg Pro Arg Pro Thr Ser Pro Phe Ser Thr Asp Ser Asn Thr Ser 

405 410 415 

Ala Ala Leu Ser Gin Ser Gin Arg Pro Arg Pro Thr Lys Lys His Lys 
420 425 430 

Gly Gly 

(2) INFORMATION FOR SEQ ID NO: 11: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 444 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

( D ) TOPOLOGY : 1 i near 
(ii) MOLECULE TYPE: CDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
GCCCAGGCAG TTGCTGCAGC TGCGGAGTAT GCGGGCCTGA AAGTGGCTCG CCGCCAAATG 60 
CAAGATGCTG CTGGCCGCCG CCACTTCCAT GCCTCTCAGT GCCCAAGGCC CACGAGTCCT 12 0 

GTGTCCACAG ACAGCAACAT GAGTGCTGTT GTGATCCAGA AAGCCAGACC CGCCAAGAAG 180 
CAGAAACACC AGCCAGGACA TCTGCGCAGG GAAGCCTACG CAGATGATCT TCCACCCCCT 24 0 

CCAGTGCCAC CACCTGCTAT AAAATCGCCC ACTGTCCAGT CCAAGGCACA GCTGGAGGTA 3 00 

CGGCCTGTCA TGGTGCCAAA ACTCGCGTCT ATAGAAGCAA GGACAGATAG ATCGTCAGAC 360 
AGAAAAGGAG GCAGTTACAA GGGGAGAGAA GCTCTGGATG GAAGACAAGT CACTGACCTG 42 0 

CGAACAAATC CAAGTGACCC CAGA 444 

(2) INFORMATION FOR SEQ ID NO: 12: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 148 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE : peptide 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 12: 


Ala 

Gin 

Ala 

Val 

Ala 

Ala 

Ala Ala Glu Tyr Ala Gly 

Leu 

Lys 

Val 

Ala 

1 




5 


10 



15 


Arg 

Arg 

Gin 

Met 

Gin 

Asp 

Ala Ala Gly Arg Arg His 

Phe 

His 

Ala 

Ser 




20 



25 


30 



Gin 

Cys 

Pro 

Arg 

Pro 

Thr 

Ser Pro Val Ser Thr Asp 

Ser 

Asn 

Met 

Ser 



35 




40 

45 




Ala 

Val 

Val 

He 

Gin 

Lys 

Ala Arg Pro Ala Lys Lys 

Gin 

Lys 

His 

Gin 


50 





55 60 





Pro 

Gly 

His 

Leu 

Arg 

Arg 

Glu Ala Tyr Ala Asp Asp 

Leu 

Pro 

Pro 

Pro 

65 





70 

75 




80 

Pro 

Val 

Pro 

Pro 

Pro 

Ala 

He Lys Ser Pro Thr Val 

Gin 

Ser 

Lys 

Ala 





85 


90 



95 


Gin 

Leu 

Glu 

Val 

Arg 

Pro 

Val Met Val Pro Lys Leu 

Ala 

Ser 

He 

Glu 




100 



105 


110 



Ala 

Arg 

Thr 

Asp 

Arg 

Ser 

Ser Asp Arg Lys Gly Gly 

Ser 

Tyr 

Lys 

Gly 



115 




120 

125 




Arg 

Glu 

Ala 

Leu 

Asp 

Gly 

Arg Gin Val Thr Asp Leu 

Arg 

Thr 

Asn 

Pro 


130 





135 140 





Ser 

Asp 

Pro 

Arg 









145 


(2) INFORMATION FOR SEQ ID NO: 13: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 909 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
Met Phe Asn Arg Lys Thr Leu Leu Cys Thr He Leu Leu Val Leu Gin 

1-5 10 15 

Ala Val He Arg Ser Phe Cys Glu Asp Ala Ser Asn Leu Ala Pro Val 

20 25 30 

He He Glu His Pro He Asp Val Val Val Ser Arg Gly Ser Pro Ala 

35 40 45 

Thr Leu Asn Cys Gly Ala Lys Pro Ser Thr Ala Lys lie Thr Trp Tyr 

50 55 60 

Lys Asp Gly Gin Pro Val He Thr Asn Lys Glu Gin Val Asn Ser His 
65 70 75 80 

Arg He Val Leu Asp Thr Gly Ser Leu Phe Leu Leu Lys Val Asn Ser 

85 90 95 

Gly Lys Asn Gly Lys Asp Ser Asp Ala Gly Ala Tyr Tyr Cys Val Ala 

100 105 110 

Ser Asn Glu His Gly Glu Val Lys Ser Asn Glu Gly Ser Leu Lys Leu 

115 120 125 

Ala Met Leu Arg Glu Asp Phe Arg Val Arg Pro Arg Thr Val Gin Ala 

130 135 140 

Leu Gly Gly Glu Met Ala Val Leu Glu Cys Ser Pro Pro Arg Gly Phe 
145 150 155 160 

Pro Glu Pro Val Val Ser Trp Arg Lys Asp Asp Lys Glu Leu Arg He 

165 170 175 

Gin Asp Met Pro Arg Tyr Thr Leu His Ser Asp Gly Asn Leu He He 
180 185 190 


Asp Pro Val Asp 
195 

Asn Met Val Gly 
210 

Glu Lys Pro Lys 
225 

Gly Ala Ala Val 

Gin lie Thr Trp 
260 

Tyr lie Ala Lys 
275 

Ser Asp Glu Gly 
290 

Leu Glu Ala Ser 
305 

Thr Lys Pro Ala 

Glu Cys Thr Leu 
340 

Glu Gly Gin Gin 
355 

Arg Thr Lys Val 
370 

Gin Val Asp Glu 
385 

Ser Ser Leu Ser 

Gly Asn Thr Pro 
420 

Asn Gin Thr Leu 
435 

Ser Gly Lys Pro 
450 

lie Asp lie Thr 
465 

His lie Ala Asp 

Ala Lys Asn Glu 
500 

Glu Asp His Thr 
515 

Asn Phe Pro Ser 
530 

Thr Glu Val Glu 
545 

Pro He Thr Gly 

Thr Trp Phe Asn 
580 

Lys Gly Leu Lys 
595 

Asn Glu Lys Gly 
610 


Arg Ser Asp Ser 
200 

Glu Arg Val Ser 
215 

Phe Glu Gin Glu 
230 

Leu Phe Asp Cys 
245 

Lys Arg Lys Asn 

Asp Asn Arg Gly 
280 

Glu Tyr Val Cys 
295 

Ala His Leu Arg 
310 

Asp Gin Ser Val 
325 

Val Gly Gin Pro 

Asp Leu Leu Phe 
360 

Ser Pro Thr Gly 
375 

Gly Ala Tyr Val 
390 

Lys Ala Ala Leu 
405 

Ala Lys Pro Pro 

Met Val Gly Ser 
440 

Thr Pro Gly He 
455 

Asp Ser Arg He 
470 

Leu Lys Lys Pro 
485 

Asp Gly Glu Ser 

Ser Asn Ala Gin 
520 

Ser Pro Thr Gin 
535 

Leu His Trp Asn 
550 

Tyr He He Gin 
565 

He Pro Asp Tyr 

Pro Ser His Ser 
600 

He Gly Thr Pro 
615 


Gly Thr Tyr Gin 

Asn Pro Ala Arg 
220 

Pro Lys Asp Met 
235 

Arg Val Thr Gly 
250 

Glu Pro Met Pro 
265 

Leu Arg lie Glu 

Tyr Ala Arg Asn 
300 

Val Gin Ala Pro 
315 

Pro Ala Gly Gly 
330 

Ser Pro Ala Tyr 
345 

Pro Ser Tyr Val 

Thr Leu Thr He 
380 

Cys Ala Gly Met 
395 

Lys Val Thr Thr 
410 

Pro Thr He Glu 
425 

Ser Ala He Leu 

Ser Trp Leu Arg 
460 

Ser Gin His Ser 
475 

Asp Thr Gly Val 
490 

Thr Trp Ser Ala 
505 

Phe Val Arg Met 

Pro He He Val 
540 

Ala Pro Ser Thr 
555 

Tyr Tyr Ser Pro 
570 

Val Ala Ser Thr 
585 

Tyr Met Phe Val 

Ser Val Ser Ser 
620 


Cys Val Ala Asn 
205 

Leu Ser Val Phe 

Thr Val Asp Val 
240 

Asp Pro Gin Pro 
255 

Val Thr Arg Ala 
270 

Arg Val Gin Pro 
285 

Pro Ala Gly Thr 

Pro Ser Phe Gin 
320 

Thr Ala Thr Phe 
335 

Phe Trp Ser Lys 
350 

Ser Ala Asp Gly 
365 

Glu Glu Val Arg 

Asn Ser Ala Gly 
400 

Lys Ala Val Thr 
415 

His Gly His Gin 
430 

Pro Cys Gin Ala 
445 

Asp Gly Leu Pro 

Thr Gly Ser Leu 
480 

Tyr Thr Cys lie 
495 

Ser Leu Thr Val 
510 

Pro Asp Pro Ser 
525 

Asn Val Thr Asp 

Ser Gly Ala Gly 
560 

Asp Leu Gly Gin 
575 

Glu Tyr Arg He 
590 

lie Arg Ala Glu 
605 

Ala Leu Val Thr 


Thr Ser Lys Pro Ala Ala Gin Val Ala Leu Ser Asp Lys Asn Lys Met 
625 630 635 640 

Asp Met Ala lie Ala Glu Lys Arg Leu Thr Ser Glu Gin Leu lie Lys 

645 650 655 

Leu Glu Glu Val Lys Thr lie Asn Ser Thr Ala Val Arg Leu Phe Trp 

660 665 670 

Lys Lys Arg Lys Leu Glu Glu Leu lie Asp Gly Tyr Tyr lie Lys Trp 

675 680 685 

Arg Gly Pro Pro Arg Thr Asn Asp Asn Gin Tyr Val Asn Val Thr Ser 

690 695 700 

Pro Ser Thr Glu Asn Tyr Val Val Ser Asn Leu Met Pro Phe Thr Asn 
705 710 715 720 

Tyr Glu' Phe Phe Val He Pro Tyr His Ser Gly Val His Ser He His 

725 730 735 

Gly Ala Pro Ser Asn Ser Met Asp Val Leu Thr Ala Glu Ala Pro Pro 

740 745 750 

Ser Leu Pro Pro Glu Asp Val Arg He Arg Met Leu Asn Leu Thr Thr 

755 760 765 

Leu Arg He Ser Trp Lys Ala Pro Lys Ala Asp Gly He Asn Gly He 

770 775 780 

Leu Lys Gly Phe Gin He Val He Val Gly Gin Ala Pro Asn Asn Asn 
785 790 795 800 

Arg Asn He Thr Thr Asn Glu Arg Ala Ala Ser Val Thr Leu Phe His 

805 810 815 

Leu Val Thr Gly Met Thr Tyr Lys He Arg Val Ala Ala Arg Ser Asn 

820 825 830 

Gly Gly Val Gly Val Ser His Gly Thr Ser Glu Val He Met Asn Gin 

835 840 845 

Asp Thr Leu Glu Lys His Leu Ala Ala Gin Gin Glu Asn Glu Ser Phe 

850 855 860 

Leu Tyr Gly Leu He Asn Lys Ser His Val Pro Val lie Val He Val 
865 870 875 880 

Ala He Leu He He Phe Val Val He He He Ala Tyr Cys Tyr Trp 

885 890 895 

Arg Asn Ser Arg Asn Ser Asp Gly Lys Asp Arg Ser Phe 
900 905 


